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OM nucleic - nucleic search, using sw model 
Run on: 



December 13, 2003, 04:37:24 ; Search time 1755.93 Seconds 

(without alignments) 
7460.491 Million cell updates/sec 



Title: US- 09 -852 -261-3 

Perfect score : 539 

Sequence: 1 ggaccagagaccctttgcgg agtaaacattcccggaattc 53 9 

Scoring table: IDENTITY_NUC' 

Gapop 10.0 , Gapext 1.0 

Searched: 22781392 seqs, 12152238056 residues 

Total number of hits satisfying chosen parameters: 45562784 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing : Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : 



EST: * 
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em_estba : * 
em_esthum: * 
em_estin: * 
em_estmu : * 
em_estov : * 
em_estpl : * 
em_estro : * 
em_htc : * 
gb_estl : * 
gb_est2:* 
gb_htc: * 
gb_est3 : * 
gb_est4 : * 
gb_est5 : * 
em_estfun : * 
em_estom: * 
em_gs s_hum : * 
em_gss_inv: * 
em_gss_pln: * 
em_gss_vrt : * 
ern_gss_fun: * 
em_gss_mam: * 
em_gss_jnus : * 
em_gss_pro : * 
em_gss__rod: * 
em_gss_phg: * 
em_gss vrl : * 



28: gb_gssl:* 
29: gb_gss2:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 

AI169253/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



Quackenbush , J . , 
(REST) Catalog & Rat 



BASE COUNT 
ORIGIN 



AI169253 549 bp mRNA linear EST 08-JAN-1999 

EST215088 Normalized rat kidney, Bento Soares Rattus sp. cDNA clone 
RKIBP33 3* end, mRNA sequence. 
AI169253 

AI169253.1 GI:4134375 
EST . 

Rattus sp. 
Rattus sp. 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria ; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus . 

1 (bases 1 to 549) 

Lee,N.H., Glodek,A., Chandra, I., Mason, T.M., 
Kerlavage,A.R. and Adams, M.D. 
Rat Genome Project: Generation of a Rat EST 
Gene Index 
Unpublished 

On Oct 6, 1998 this sequence version replaced gi : 3705561. 
Other_ESTs: TC50779. 
Contact: Lee, NH 

The Institute for Genomic Research 

9712, Medical Center Drive, Rockville, MD 20850, USA 
Tel: (301) -838-3529 
Fax: (301) -838-0208 
Email: nhlee@tigr.org 
Seq primer: M13-21. 

Location/Qualifiers 
1. .549 

/organism="Rattus sp." 
/mol_type-"mRNA" 
/db_xref="taxon: 10118" 
/clone="RKIBP33" 

/clone__lib= "Normalized rat kidney, Bento Soares" 
/not e=" Organ: kidney; Vector: pT7T3Pac; Site_l: EcoRI ; 
Site_2: Not 1 11 
112 a 140 c 133 g 164 t 



Query Match 93.1%; Score 501.6; DB 9 

Best Local Similarity 97.3%; Pred. No. 3.7e-107 
Matches 510 ; Conservative 0 ; Mismatches 14 



Length 54 9; 

Indels 0; Gaps 0; 



Qy 



Db 



Qy 



Db 



8 AGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCAAGGGGCT 67 

1 1 1 1 1 ! 1 1 1! I II I li I II I II III 1 1 1 II 1 1 1 1 II I II II 1 1 1 1 II II 1 1 1 1 II 

54 9 AGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAATTCGTGTGTGGACCAAGGGGCT 4 90 

68 TTTACTTCAACAAGCCCAC^GTCTA 127 

IIIIIIIIIIIIIIIIIIMI I . 1 1 ■ I i ' 

48 9 TTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGA 430 



Qy 12 8 TTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTACTGTGTCC 187 

IIIIIIIIMIIIMIIIIIIIMIIIIIMMIIIIIIIIIIIMIIIIIIIIII I 

Db 429 TTGTGGATGAGTGTAGCTT CCGGAG CTGTGATCTGAGGAGG CTGGAGATGTACTGTG CTC 370 

Qy 188 GCTGCAAGCCTACAAAGTCAGCTCGTTCC^TCCGGGCCCAGCGCCACACTGAC^TGCC^ 247 

1 1 1 1 1 1 li 1 1! 1 1 1 Ml 1 1 1 II I II I II 1 1 1 1 1 1 MM I II I 1 1 Ml II I III 

Db 369 CGCTGAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACATTGACATGCCCA 310 

Qy 24 8 AGACTCAGAAGT CC CAG CCC CTAT CGA CACA CAAGAAAAGGAAG CTG CAAAGGAGAAGGA 3 07 

1 1 1 1 1 M N 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

Db 3 09 AGACT CAGAAGTC CCAG CC CCTATCGACACACAAGAAAAGGAAG CTGCAAAGGAGAAGGA 250 

Qy 308 AAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGC^ 3 67 

MMMMMMIMIMMMIMIMMMMMMMIIMMIMI MINIMI 

Db 24 9 AAGGAAGTAC^CTTGAAGAACACAAGTAGAGGAAGTGC^GGAAACAAGACTTAaVGAATG 190 

Qy 368 TAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTGCTGCTTG 427 

I II II M IM II Ml Mill I M I III 1 1 Ml M I 1 1 Ml M M 1 1 1 1 M 1 1 1 1 1 1 1 1 

Db 18 9 TAGGAGGAG CCT C CCGAGGAACAGAAAATGCCACGT CAC CG CAAGATCCTTTG CTG CTTG 13 0 

Qy 428 AGCAACCTGCAAAACATCGGAACACCTGCCAAATATC 487 

1 1 1 M I M I M M M I M M 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 IM M II 1 1 1 1 II M MMM 

Db 12 9 AG CAAC CTG CAAAACAT CGGAACAC CTG CCAAATATCAATAATGAGTTCAATA C CATTT C 7 0 

Qy 488 AGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCC 531 

Db 69 AGAGATGGG CATTT C CCTCAATGAAATACACAAGTAAACATTCC 2 6 



RESULT 2 
CD373004 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



614 bp 



mRNA 



linear EST 29-MAY-2003 
UI-R-GR0 Rattus norvegicus cDNA clone 
mRNA sequence. 



CD373004 

UI -R-GR0 -csv- j -17-0 -UI .rl 
UI-R-GRO-csv-j -17-0-UI 5' 
CD373004 

CD373004.1 GI:31157094 
EST. 

Rattus norvegicus (Norway rat) 
Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; 
Rattus . 

1 (bases 1 to 614) 

Bonaldo,M.F. , Lennon,G. and Soares,M.B. 

Normalization and subtraction: two approaches to facilitate gene 
discovery 

Genome Res. 6 (9), 791-806 (1996) 

97044477 

8889548 

Contact: Soares, MB 

Coordinated Laboratory for Computational Genomics 
University of Iowa 

375 Newton Road , 4156 MEBRF, Iowa City, IA 52242, USA 

Tel: 319 335 8250 

Fax: .319 335 9565 

Email: bento-soares@uiowa.edu 

Tissue Procurement: James Lin, University of Iowa 



cDNA Library preparation: Dr.- M . Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M . Bento Soares, University of Iowa 
Clone Distribution: Distribution information can be found at 
http: //genome .uiowa . edu/distribut ion/rat .html 
Seq primer: M13. REVERSE . 
FEATURES Location/Qualifiers 
source 1. .614 

/organism="Rattus norvegicus" 

/mol_type-"mRNA" 

/ s t ra in= " Sprague - Dawl ey " 

/db_xref="taxon: 10116" 

/clone="UI-R-GRO-csv-j -17-0-UI" 

/tissue_type=" Whole embryo" 

/dev_stage=" embryo 13dpc" 

/lab_host-"DH10B (Life Technologies) (Tl phage resistant)" 
/clone_lib="UI-R-GRO" 

/note="Vector: pYX-Asc; Site_l : EcoR I; Site__2: Not I; 
UI-R-GRO is a cDNA library containing the following 
tissue (s) : rat whole embryo 13dpc. The library was 
constructed according to Bonaldo, Lennon and Soares, 
Genome Research, 6: 791-806 , 1996. Denatured RNA was size 
fractionated on a 1% agarose gel. First strand cDNA 
synthesis was primed with oligo-dT primer containing a Not 
I site. Double strand cDNA was size selected according to 
mRNA size fraction, ligated with EcoR I adaptor, digested 
with NotI and then cloned directionally into pYX-Asc 
vector. The library tag sequence located between the Not I 
site and the polyA tail is CATCTCTACT. This library was 
created for the University of Iowa Program for Rat Gene 
Discovery and Mapping (Val Sheffield, Bento Soares and Tom 
Ca savant) . " 

BASE COUNT 171 a 168 c 154 g 119 t 2 others 

ORIGIN 



Query Match 89.1%; Score 480.4; DB 14; Length 614; 

Best Local Similarity 97.6%; Pred. No. 3.5e-102; 

Matches 487; Conservative 0; Mismatches 12; Indels 0; Gaps 0; 
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1 M 1 1 Ml 1 1 1 IM Ml 1 1 1 1 M M 1 1 II 1 1 1 II IM 1 1 1 1 1 1 1 1 Ml 1 1 

GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 
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AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGG 
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Db 


176 


Ml 1 II 1 1 1 ill 1 1 ll 1 Ml II II M Mllllllllllll MMIMMIM 

AGGGGCTTTTACTT(^(^GCCCACAGGCTATGGCTCCAGC^TTCGGAGGGCACCACAG 


235 


Qy 


121 


ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 

II MINI Mllllllllllll III llllll IIIIMI 1 IIIIIIIIIMI IIIIIIMI 

ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 


180 


Db 


236 


295 


Qy 


181 


TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCC^TCCGGGCCCAGCGCCACACTGAC 


240 


Db 


296 


1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TGTGCTCCGCTGAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 


355 


Qy 


241 


ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACAC^ 


300 



II llllll llllll IMIIIIIIIIMIIIIMIIIII IMIMIMMIMMIMI 



Db 



356 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAAA^ 415 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 



3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 360 

IIIIIIIIIIIIMMIIIIIIIIIIIIIMIIIIIIII lllllllllllll I II III II 

416 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 475 
361 C^GAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 42 0 

■ . IIIIIIIIMIII lllllllll 

476 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATTCCACGTCACCGCATGATCCTTTG 53 5 
421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCC^^ 480 

IIIIIIIIIIMIIINI IIIIIIIIIIIIMirilMIIIIIIIIIIIIMMIIIII 

536 CTGCTTGAGCAACCTGCANAACATCGGAACACCTGCCAAATATCAATAATGAGTTCAATA 595 
481 T CATTTCAGAGATGGG CAT 4 99 

IIIMIIIIMIIIIIM 

596 CCATTTCAGAGATGGGCAT 614 



RESULT 3 

AI503976/C 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



AI503976 558 bp mRNA linear EST ll-MAR-1999 

vm43d08.xl Stratagene mouse diaphragm (#937303) Mus musculus cDNA 
clone IMAGE: 1001007 3 1 similar to gb:X04482 Mouse mRNA for 
preproinsulin-like growth factor IB (MOUSE);, mRNA sequence. 
AI503976 

AI 503 976. 1 GI : 44 01827 
EST . 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 558) 

Marra,M., Hillier,L. # Kucaba,T., Martin, J . , Beck,C, Wylie,T., 

Underwood,K. , Steptoe,M., Theising,B., Allen, M. , Bowers, Y., Person 
# B. # Swaller,T., Gibbons, M., Pape,D., Harvey, N., Schurk,R., Ritter 
,E., Kohn,S., Shin,T., Jackson, Y., Cardenas, M. , McCann,R., 

Waterston,R. and Wilson, R . 

The WashU-NCI Mouse EST Project 1999 

Unpublished 

Contact: Marra M/WashU-NCI Mouse EST Project 1999 
Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108, USA 

Tel: 314 286 1800 

Fax: 314 286 1810 

Email: mouseest@watson.wustl.edu 

This clone is available royalty- free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
MGI : 565223 

This clone was previously sequenced on the 5 * end only, this new 

data is from the 3 1 end 

High quality sequence stop: 44 0. 

Location/Qualifiers 

1. .558 

/organism= "Mus musculus" 

/ mo 1 _ t yp e - " mRNA " 

/db xref="taxon: 10090" 



/clone=" IMAGE: 1001007" 
/tissue_type=" diaphragm" 
/dev_stage= "adult" 

/lab_host="SOLR {kanamycin resistant) " 
/clone_lib="Stratagene mouse diaphragm (#937303)" 
/note="Organ: diaphragm; Vector: pBluescript SK- ; Site__l: 
EcoRI; Site_2: Xhol ; Cloned unidirectionally from mRNA 
prepared from diaphragm muscle. Primer: Oligo dT. Average 
insert size: 1.5 kb. Uni-ZAP XR Vector; ~5 ' adaptor 
sequence: 5 f GAATTCGGCACGAG 3' ~3 ' adaptor sequence: 5 f 
CTCGAGTTTTTTTTTTTTTTTTTT 3 1 " 

BASE COUNT 103 a 133 c .14 9 g 173 t 

ORIGIN 

Query Match 87.1%; Score 469.2; DB 9; Length 558; 

Best Local Similarity 92.8%; Pred. No. 1.5e-99; 

Matches 492; Conservative 0; Mismatches 38; Indels 0; Gaps 0; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

MIMMMIIMMMMIIMMMMMMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 53 0 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 471 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGCA.CC^ 120 

: , i i n ' i i i iiiMiiiMiiiiiiiiiiiiiiiii mi 

Db 470 AGGGGCTTTTACTTCmO^GCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCTCAG 411 

Qy 121 ACGGGGATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II I M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 MIMMIMM 

Db 410 ACAGGCATTGTGGATGAGTGTTGCTTC CGGAG CTGTGATCTGAGGAGACTGGAGATGTAC 351 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCC^ 24 0 

. 1 1 1 1 II II Ml III Ml MM II II II Ml 1 1 1 1 1 II i M I M 1 1 i 1 1 

Db 350 TGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGCGCCACACTGAC 2 91 

Qy 241 ATGC CCAAGACTCAGAAGT C C CAGCCC CTATCGACACACAAGAAAAGGAAG CTG CAAAGG 300 

IMIIIIIIIIIIIIIIIIIM I IIIIIIIMM IIMIIMI 1 1 1 1 1 1 M 1 1 1 1 1 

Db 2 90 ATGCCCAAGACTCAGAAGTCCCCGTCCCTATCGACAAACAAGAAAACGAAG CTG CAAAGG 231 

Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTG CAGGAAACAAGAC CTA 360 

MMMMMMMIMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 23 0 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 171 

Qy 361 CAGAATGTAGGAGGAGCCTCCCGAGGAAC^GAAAATGCCACGTC^CCGt^AGATCCTTTG 42 0 

MMMMMMMMIIMM III 1 1 1 1 M 1 1 M 1 1 1 II I Mill 1 1 1 i 1 1 1 1 1 

Db 170 CAGAATGTAGGAGGAGCCTCCCACGGAGCAGAAAATGCCACATCACCGCAGGATCCTTTG 111 

Qy ^421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCAATAATGA 480 

I II I MM MM II I I MUM III Mill 

Db 110 CTGCTTGAGO^CCTGCAAAACATCGAAACACCT^^ 51 

Qy 481 TCATTTCAGAGATGGG(^TTTCCCTCAATGAAATACAO^GTAAACATTC 53 0 

II II II II I I I I II II II I I I II I I I II II I M I II I II II I I I I 
Db 5 0 ACATTACAAAGATGGGCATTTCCCCCAATGAAATATACAAGTAAACATTC 1 



RESULT 4 
AW146128/C 



LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



AW146128 623 bp mRNA linear EST 10-OCT-2000 

um37el0.xl Sugano mouse embryo mewa Mus musculus cDNA clone 
IMAGE:2247498 3' similar to gb:X04482 Mouse mRNA for 
preproinsulin-like growth factor IB (MOUSE);', mRNA sequence. 
AW146128 

AW146128.1 GI:6167864 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; . Murinae; Mus . 
1 (bases 1 to 623) 

Marra,M., Hillier,L., Kucaba,T., Martin, J., Beck,C, Wylie,T., 

Underwood, K. , Steptoe,M., Theising,B., Allen, M., Bowers, Y. # Person 

,B., Swaller,T., Gibbons, M., Pape,D., Harvey, N., Schurk,R., Ritter 

, E., Kohn,S., Shin,T., Jackson, Y. , Cardenas, M. , McCann,R. , 

Waterston,R. and Wilson, R. 

The WashU-NCI Mouse EST Project 1999 

Unpublished 

Contact: Marra M/WashU-NCI Mouse EST Project 1999 
Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108, USA 

Tel: 314 286 1800 

Fax: 314 286 1810 

Email: mouseest@watson.wustl.edu 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
MGI: 1006958 

Seq primer: custom primer used 
High quality sequence stop: 499. 
Location/Qualifiers 
1. .623 

/organism="Mus musculus" 
/mol_type="mRNA" 
/strain-"C57BL" 
/db_xref -"taxon: 10090" 
/clone=" IMAGE: 2247498" 
/dev_stage=" embryo, 14 dpc" 
/lab_host="DH10B" 

/clone_lib=" Sugano mouse embryo mewa" 

/note= "Vector: pME18S-FL3; Site_l: Drain (CACTGTGTG) ; 
Site_2: Drain (CACCATGTG) ; 1st strand cDNA was primed 
with an oligo(dT) primer [ATGTGGCCTTTTTTTTTTTTTTTTT] ; 
double -stranded cDNA was ligated to a Drain adaptor 
[ TGTTGGCCTACTGG ] , digested and cloned into distinct Drain 
sites of the pME18S-FL3 vector (5 1 site CACTGTGTG, 3' site 
CACCATGTG) . Xhol should be used to isolate the cDNA 
insert. Size selection was performed to exclude fragments 
<1.5kb. Library constructed by Dr. Sumio Sugano 
(University of Tokyo Institute of Medical Science) . 
Custom primers for sequencing: 5' end primer 
CTTCTGCTCTAAAAGCTGCG and 3 ' end primer 
CGACCTGCAGCTCGAGCACA . " 
123 a 138 c 170 g 191 t 1 others 



Query Match 



84.1%; Score 453.4; DB 9; Length 623; 



Best Local Similarity 92.1%; Pred. No. 7.5e-96; 

Matches 489; Conservative 0; Mismatches 41; Indels 1; Gaps 



Qy 


1 


GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 


60 


Db 


541 


MMIIIIIIIIIIM IIIMIIIIIIIIIIIII IIIIMIIIIIIIIIIIIIIIII 

GGACCAGAGACCCTTTTCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 


482 


Qy 


61 


AGGGGCTTTTACTTCM.CAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGCACCACAG 


120 


Db 


481 


II Ml MM II Ml MM II II III III II Ml II MM II MMIMIMIMI Ml 

AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCTCAG 


422 


Qy 


121 


ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 


180 


Db 


421 


II MMMMIMMMMMMIMMMIMIMMIMIMM Mill MUM 

ACAGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGACTGGAAATGTAC 


362 


Qy 


181 


TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 


240 


Db 


361 


MINI IIIIM II III MM II II Mill IIIIM III III IIIIM 

TGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGCGCCACACTGAC 


302 


Qy 


241 


ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGAC^CACAAGAAAAGGAAGCTGCAAAGG 


300 


Db 


301 


Illlllllllllllllllllll 1 1 II 1 II II 1 M 1 1 1 II 1 1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 1 

ATGCCCAAGACTCAGAAGTCCCCGTCCCTATCGACAAACAAGAAAACGAAGCTGCAAAGG 


242 


Qy 


301 


AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 


360 


Db 


241 


IIIIM MMMMM II MM Mill 1 1 1 1 1 1 1 1 M 1 1 M 1 M II 1 II 1 II 1 1 II M 

AGAAGGAAAGGAAGTACATTTGAAGAACCCAAGTAGAGGAAGTGCAGGAAACAAGACCTA 


182 


Qy 


361 


CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 


420 


Db 


181 


MMIMIMM MM IIIIM II 1 II M II 1 II 1 1 1 1 II II 1 1 1 1 II II 1 M 1 1 

CAGAATGTAGGAGGAG C CTCC CACGGAGCAGAAAATG CCACAT CACCGCAGGAT CCTTTG 


122 


Qy 


421 


CTGCTTGAGCAACCTG CAAAACATCGGAACAC CTGC CAAATATCAATAATGAGTT CAATA 


480 


Db 


121 


III MMMMMMMI IIIIM II III III II Mill Mill II III Mill 

CTGCTTGAGCAACCTGCAAAACATCGAAACCCCTACCAAATAACAATAATAAGTCCAATA 


62 


Qy 


481 


TCATTTCAGAGATGGGGATTTCCCTCAATGAAATACACAAGTAAACATTCC 531 




Db 


61 


MM II IIMIIIIIIIMM MMMMM MMMMMMMI 

ACATTACAAAGATGGGCATTTCCC - CAATGAAATATACAAGTAAACATTCC 1 2 





RESULT 5 

AI169770/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 



mRNA linear 
Bento Soares Rattus 



AI169770 468 bp 

EST215669 Normalized rat liver, 
RLIAT07 3' end, mRNA sequence. 
AI169770 

AI169770 . 1 GI :3709810 
EST. 

Rattus sp . 
Rattus sp. 

Eukaryota; Metazoa; Chordata,- 
Mammalia; Eutheria; Rodent ia; 
Rattus . 

1 (bases 1 to 468) 

Lee,N.H., Glodek,A. , Chandra, I., Mason, T.M. , 

Kerlavage,A.R. and Adams, M.D. 

Rat Genome Project: Generation of a Rat EST 



EST 20-JAN-1999 
sp. cDNA clone 



Craniata ; Vertebrata ; Eut eleostomi ; 
Sc iurognathi ; Muridae; Murinae; 



Quackenbush, J. , 
(REST) Catalog & Rat 



Gene Index 
JOURNAL Unpublished 
COMMENT Other_ESTs: TC50779 

Contact: Lee, NH 

The Institute for Genomic Research 

9712, Medical Center Drive, Rockville, MD 20850, USA 
Tel: (301) -838-3529 
Fax: (301) -838-0208 
Email: nhlee@tigr.org 
Seq primer: M13-2-1. 
FEATURES Location/Qualifiers 
source 1. .468 

/organism="Rattus sp." 
/mol_type="mRNA" 

/db_xref-"ATCC (inhost) :2027570" 
/db_xref="taxon: 10118" 
/clone="RLIAT07" 

/clone_lib= "Normalized rat liver, Bento Soares" 

/no te=" Organ: liver; Vector: pT7T3Pac; Site_l: EcoRI ; 

Site_2: Not I" 

BASE COUNT 85 a 115 c 119 g 149 t 

ORIGIN 

Query Match 83.6%; Score 450.4; DB 9; Length 468; 

Best Local Similarity 97.6%; Pred.. No. 3.6e-95; 

Matches 457; Conservative 0; Mismatches 11; Indels 0; Gaps 0; 
Qy 63 GGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGC^ 122 

IIIMIIIMIIIIIIIMIIIIMI IMIIIIIIIIIIIMIIMMIIIMIIIIII 

Db 4 68 GGGCTTTTACTTCAACAAGCCC^CAGGCTATGGCTCCAGC 4 09 

Qy 123 GGG CATTGTGGATGAGTGTTGCTTCCGGAG CTGTGATCTGAGGAGG CTGGAGATGTACTG 182 

IIIIIIIIIMIIIIIIIIIIII IIIIMIIIIII MINIM II 1 1 1 1 1 i 1 1 1 1 1 1 1 

Db 4 08 GGGCATTGTGGATGAGTGTTGCTCCCGGAGCTGTGATCTGAGGAGGTTGGAGATGTACTG 34 9 

Qy 183 TGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGG 242 

II I I II I II I Ml 1 1 II M 1 1 II II 1 1 1 1 1 M 1 1 1 II I M II II M Ml II M 

Db 34 8 TGCTCCGCTGAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGACAT. 28 9 

Qy 243 G C C CAAGACT CAGAAGT CCCAGCCC CTATCGACACACAAGAAAAGGAAGCTGCAAAGGAG 3 02 

I I I I 1 I I I 1 I I I I I I I I 11 I I I I I I I I I 1 I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

Db 28 8 GCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAAAAGGAAGCTGCAAAGGAG 22 9 

Qy 303 AAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTG CAGGAAACAAGAC CTACA 362 

1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 

Db 22 8 AAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGC^ 169 

Qy 3 63 GAATGTAGGAGGAG CCTCCCGAGGAACAGAAAATGC CACGT CAC CGCAAGATCCTTTGCT 422 

1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 II 1 1 II 1 1 

Db 168 GAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTGCT 109 

Qy 4 23 GCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAAT 4 82 

I I I I I I I I I I M I II I I II II II II M I I I I I II I I I I I I II I I I I II I I I I I I M I I I 

Db 108 GCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCAATAATGAGTTC^^ 4 9 

Qy 4 83 ATTTC^GAGATGGGO^TTTCCCTCAATGAAATACACAAGTAAACATTC 530 

I I Ml 1 1 1 M II II I M I II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 



Db 4 8 ATTTC^GAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTC 1 



RESULT 6 
AI265629/C 

LOCUS AI265629 558 bp mRNA linear EST 18-NOV-1998 

DEFINITION uj04b07.xl Sugano mouse liver mlia Mus musculus cDNA clone 
IMAGE: 1890901 3' similar to gb:X04482 Mouse mRNA for 
preproinsul in-like growth factor IB (MOUSE);, mRNA sequence. 

ACCESSION AI265629 

VERSION AI265629.1 GI:3883787 

KEYWORDS EST . 

SOURCE Mus musculus (house mouse) 

ORGANISM Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
REFERENCE 1 (bases 1 to 558) 

AUTHORS Marra,M., Hillier,L., Allen, M., Bowles, M., Dietrich, N., Dubuque, T. , 
Geisel,S., Kucaba,T., Lacy,M. # Le , M . , Martin, J. , Morris, M. , 
Schellenberg,K. , Steptoe,M. , Tan,F., Underwood, K. , Moore, B. # 
Theising,B., Wylie,T., Lennon,G., Soares,B., Wilson,R. and 
Waterston, R . 
TITLE The WashU-HHMI Mouse EST Project 

JOURNAL Unpublished 
COMMENT Contact: Marra M/Mouse EST Project 

WashU-HHMI Mouse EST Project 
Washington University School of MedicineP 
4444 Forest Park Parkway,. Box 8501, St. Louis, MO 63108 
Tel: 314 286 1800 
Fax: 314 286 1810 
Email: mouseest@watson.wustl.edu 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
MGI: 975225 

Seq primer: custom primer used 
High quality sequence stop: 495. 
FEATURES Location/Qualifiers 
source 1. .558 

/ organ ism=" Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL" 

/db_xref ="taxon: 10090" 

/clone=" IMAGE: 18 90901" 

/sex=" female" 

/dev_stage= "adul t " 

/lab_host="DH10B" 

/clone_lib=" Sugano mouse liver mlia" 

/note="Organ: liver; Vector: pME18S-FL3; Site_l: Dralll 
(CACTGTGTG) ; Site_2 : Dralll (CACCATGTG) ; 1st strand cDNA 
was primed with an oligo(dT) primer 

. [ATGTGGCCTTTTTTTTTTTTTTTTT] ; double -stranded cDNA was 
ligated to a Drain adaptor [TGTTGGCCTACTGG] , digested 
and cloned into distinct Dralll sites of the pME18S-FL3 
vector (5' site CACTGTGTG, 3' site CACCATGTG). Xhol should 
be used to isolate the cDNA insert. Size selection was 
performed to exclude fragments <1.5kb. Library 
constructed by Dr. Sumio Sugano (University of Tokyo 



BASE COUNT 
ORIGIN 



Institute of Medical Science) . Custom primers for 
sequencing: 5' end primer CTTCTGCTCTAAAAGCTGCG and 3' end 
primer CGACCTGCAGCTCGAGCACA . " 
106 a 135 c 156 g 161 t 



Query Match 82.9%; Score 446.8; DB 9; Length 558 ; 

Best Local Similarity 92.7%; Pred . No. 2.6e-94; 

Matches 469; Conservative 0; Mismatches 37; Indels 0; Gaps 0; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

lllllll IMIMIIMIIIIIIIIIIIIIIilll IIMIIIMIIIIIIIIIIMII 

Db 506 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 44 7 



Qy 

Db 



61 AGGGGCTTTTACTTCAACAAGCCCAC^ 120 

i ' i ■ i 1 i , i i i i i immmmimmmmmmimm in 

44 6 AGGGGCTTTTACTTCAACAAGCCCAC^GGCTATGGCTCCAGCATTCGGAGGGCACCTCAG 387 



Qy 

Db 

Qy 
Db 

Qy 

Db 
Qy 

Db 



121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II MM II I II MM Ml II III Mill II I MM 1 1 MUM I II Mill III II II 

38 6 ACAGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGACTGGAGATGTAC 327 
181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTC(^TCCGGGCCCAGCGCCACACTGAC 24 0 

INI II Ml I II I MM MM M II Mill lllllllllll lllllll 

326 TGTGCCCC^CTGAAGCCTACAAAAGCAGCCCGCTCT^ 2 67 

241 ATG CC CAAGACT CAGAAGTCC CAG CCC CTAT CGACACACAAGAAAAGGAAG CTG CAAAGG 300 

1 1 1 1 1 1 II II I II I II 1 1 M II I Mill MUM MIIIIMI 

266 ATGCCCAAGACTCAGAAGTCCCCGTCCCTATCGACAAACAAGAAAACGAAGCTC 2 07 

301 AGAAGGAAAGGAAGTACACTTGAAGAAC^CAAGTAGAGGAAGTGCAGGAAACAAGACCTA 3 60 

I I I I I I Mill lllllllllll II MIMIMMMMIMIIMM 

2 06 AGAAGGAAAGGAAGTACATTTGAAGAACAC^GTAGAGGAAGTGC^GGAAACAAGACCTA 147 



Qy 

Db 

Qy 
Db 



361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 420 

111111,1 Ml II II lllllllll IMIIIII 1:11 

14 6 CAGAATGTAGGAGGAGCCTCCCA.CGGAGCAGAAAATGCCACATCACCGCAGGATCCTTTG 8 7 

421 CTGCTTGAGCAACCTG CAAAACATOGGAACACCTG C CAAATAT CAATAATGAGTTCAATA 480 

I I I I I I ! I I I I lllllll MMIM lllllll III Mill 

86 CTGCTTGAGCAACCTGCAAAACATCGAAACACCTACCAA^ 27 



Qy 



Db 



481 T CATTT CAGAGATGGG CATTTC CCTC 506 

MM II lllllll IMIIIII I 

26 ACATTACAAAGATGGGCATTTCCCCC 1 



RESULT 7 

BQ200567/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



BQ200567 

UI-R-DZl-cne-a-18 
UI-R-DZl-cne-a-18 
BQ200567 

BQ2 00567.1 GI : 2 0417 032 
EST. 

Rattus norvegicus (Norway rat) 
Rattus norvegicus 



653 bp mRNA linear EST 02-MAY-2 002 
0-UI.sl UI-R-DZ1 Rattus norvegicus cDNA clone 
0-UI 3', mRNA sequence. 



Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; 
Rattus. 

REFERENCE 1 (bases 1 to 653) 

AUTHORS Bonaldo,M.F. , Lennon,G. and Soares,M.B. 

TITLE Normalization and subtraction: two approaches to facilitate gene 
discovery 

JOURNAL Genome Res. 6 (9), 791-806 (1996) 
MEDLINE 97044477 
PUBMED 8889548 
COMMENT Contact : Soares , MB 

Coordinated Laboratory for Computational Genomics 
University of Iowa 

375 Newton Road , 4156 MEBRF , Iowa City, IA 52242, USA 

Tel: 319 335 8250 

Fax: 319 335 9565 

Email : bento-soares@uiowa . edu 

The sequence contained an oligo-dT track that was present in the 
oligonucleotide that was used to prime the synthesis of first 
strand cDNA and therefore this may represent a bonaf ide poly A 
tail. The sequence tag present in the cDNA between the NotI site 
and the oligo-dT track served to verify it as a clone from the 
normalized cartilaginous tumor library cDNA Library Preparation: 
M.B. Soares Lab Clone distribution: clones will be available 
through Research Genetics (www.resgen.com) The following repetitive 
elements were found in this cDNA sequence: 1-43, 
>AT_rich#Low_complexity 118-164, >POLY__A#Simple_repeat 
Seq primer: M13 Forward 
POLYA^Yes . 

FEATURES Location/Qualifiers 
source 1. .653 

/organism="Rattus norvegicus" 

/mol_type="mRNA" 

/ s t r a i n= " Sprague - Da wl ey " 

/db_xref="taxon: 10116" 

/clone="UI-R-DZl-cne-a-18-0-UI" 

/t is sue__type=" Chondrosarcoma" 

/dev_stage="37 days" 

/lab_host="DH10B (Life Technologies) " 
/clone_l ib= "UI -R-DZ1 " 

/note= M 0rgan: Spine; Vector: pT7T3D-Pac (Pharmacia) with a 
modified polylinker; Site_l: Not I; Site_2 : Eco RI ; 
UI-R-DZ1 is a normalized cDNA library containing the 
following tissue(s): Swarm Rat Chondrosarcoma. The library 
was constructed according to Bonaldo, Lennon and Soares, 
Genome Research, 6:791-806, 1996. First strand cDNA 
synthesis was primed with an oligo-dT primer containing a 
Not I site. Double stranded cDNA was ligated to an EcoR I 
adaptor, digested with Not I, and cloned directionally 
into pT7T3-Pac vector. The oligonucleotide used to prime 
the synthesis of first-strand cDNA contains a library tag 
sequence that is located between the Not I site and the 
(dT) 18 tail. The sequence tag for this library is 
CATTCTTGTA . The Rat cartilaginous tumor tissue was 
provided by Dr Jeff Stevens at the University of Iowa. 
TAG_LIB=UI-R-DZ1 
TAG_TISSUE=cartilaginous tumor 



BASE COUNT 
ORIGIN 



TAG__S EQ = CATTCTTGTA " 
134 a 137 c 137 g 245 t 



Query Match 81.1%; Score 437; DB 13; Length 653; 

Best Local Similarity 97.6%; Pred. No. 5.3e-92; 

Matches 454; Conservative 0; Mismatches 10; Indels 1; Gaps 



Qy 



Db 



1; 



67 TTTTACTTCAACAAGCC(^CAGTCTATGGCTCCAGCATTCGGAGGGCACCACAGACGGGC 126 

III Mill Illlllllllllll 1 1 1 INI 1 1 1 1 II I II 1 1 1 1 1 1 Illlllllllllll 

653 TTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCACAGACGGGC 594 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



127 ATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTACTGTGTC 186 

Illlllllllllll IMIIIMIIIMIIIIMIIMIIIIIMIIIIIIIIIIIII 

593 ATTGTGGATGAGTG -TGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTACTGTGCT 535 
187 CGCTGCAAGCCTACAAAGTCAGCTCGTTCCA^ 246 

I immimmmmmiiimmmimmmmmmimmmimiimm 

534 CCGCTGAAGCCTACAAAGTCAGCTCGTTCC^TCCGGGCCC^GCGCCACACTGAC^TGCCC 4 75 

247 AAGACT CAGAAGTCC CAGCCCCTATCGACACACAAGAAAAGGAAG CTG CAAAGGAGAAGG 306 

IIMIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 
474 AAGACTCAGAAGTC CCAG CC C CTATCGACACACAAGAAAAGGAAGCTGCAAAGGAGAAGG 415 

307 AAAGGAAG TA CA CTTGAAGAACACAAGTAGAGGAAGTG CAGGAAACAAGA C CTA CAGAAT 366 

• IIIIIIMilllllllllllllllMMIIIIIIIIIIIIIIIIIMIIIIMIIMIM 

414 AAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCT^ 355 
367 GTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTGCTGCTT 426 

■ HIM 

354 GTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTGTTGCTT 2 95 
427 GAGCAACCTGCAAAACATCGGAACACCTGCCAAATATQ^ 48 6 

MIMIMIMMMMIMMMMMIMMIMM Ill Mill III Mill Mill 

2 94 GAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCAAT 235 



Qy 



Db 



4 87 CAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCC 531 

I I I I !l I II: I I I I I I MM I I I I I I I I II I : I I ! I I 

234 CAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCC 190 



RESULT 8 
BF383724 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



BF383724 594 bp mRNA linear EST 27-NOV-2000 

602044632F1 NCI_CGAP_Li9 Mus musculus cDNA clone IMAGE : 4194295 5', 
mRNA sequence. 
BF383724 

BF383724.1 GI: 11365029 
EST . 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata ; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 594) 
NIH-MGC http: //mgc.nci .nih.gov/ . 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 



COMMENT Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 
Tissue Procurement: Jeffrey E. Green, M.D. 
cDNA Library Preparation: Life Technologies, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
ht tp : / / image . 1 lnl . gov 
Plate: LLAM9527 row: p column: 08 
High quality sequence stop: 58 9. 
FEATURES Location/Qualifiers 
source 1- .594 

/organism="Mus musculus" 
/mol_type^"mRNA H 
/strain="FVB/N" 
/db_xref-"taxon: 10090" 
/clone=" IMAGE: 41942 95" 

/lab_host="DH10B (Tl phage-resistant) " 
/clone_lib="NCI_CGAP_Li9" 

/note= M 0rgan: liver; Vector: pCMV-SP0RT6; Site_l: NotI; 

Site__2: Sail; Cloned unidirectionally. Primer: Oligo dT. 

Average insert size 1.9 kb. Constructed by Life 

Technologies. Note: this is a NCI_CGAP Library." 
BASE COUNT 175 a 162 c 142 g 115 t 

ORIGIN 

Query Match 80.1%; Score 431.8; DB 10; Length 594; 

Best Local Similarity 92.4%; Pred. No. 8.5e-91; 

Matches 465; Conservative 0; Mismatches 37; Indels 1; Gaps 1; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

lllllllllllll IIIIIIIMMIMIIIIIM Ml III Mill 1 1 Ml! II MM 

Db 93 GGACCAGAGACCC - TTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 151 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCT^ 12 0 

1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIIIIIIIIIIMIIIIIIIIIIIIII III 

Db 152 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCTCAG 211 

Qy 121 ACGGG CATTGTGGATGAGTGTTGCTT C CGGAGCTGTGATCTGAGGAGG CTGGAGATGTAC 18 0 

II 1 1 1 1 II II 1 1 1 M 1 1 1 1 1 1 1 M I M I M M II I M 1 1 1 1 II II I M M M 1 1 II 1 1 

Db 212 ACAGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGACTGGAGATGTAC 271 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

MM M I MM 1 1 MM MM II II Mill 1 1 1 1 1 1 II 1 1 1 1 II II 1 1 

Db 272 TGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGCGCCACACTGAC 331 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACA 300 

MMMMMMIMIMIMI I Ml 1 1 Ml III II I II MM I MM I Ml II II 

Db 332 ATGCCCAAGACTCAGAAGTCCCCGTCCCTATCGACAAACAAGAAAACGAAGCTGCAAAGG 391 

Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 360 

Mill lllllllllllll MIIIMMIIIIIIMIMMI Mill III Mill Mill 

Db 3 92 AGAAGGAAAGGAAGTACATTTGAAGAAC^(^GTAGAGGAAGTG(^GGAAACAAGACCTA 451 

Qy 3 61 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 42 0 

1 1 1 1 1 1 M 1 1 1 1 M I II I II 1 1 III MINIUM I MUM lllllllll 



Db 


452 


CAGAATGTAGGAGGAGC CTCCCACGGAGCAGAAAATG CCACAT CACCG CAGGAT CCTTTG 


511 


QY 


421 


CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAA 


480 


Db 


512 


MMMIIIMIIIIIMIIIIIIII IIIIMI Mlllll Mlllll III Mill 

CTGCTTGAGCM.CCTGCAAAACATCGAAACACCTACCAAATAACAATAATAAGTCCAATA 


571 


Qy 


481 


T CATTTCAGAGATGGG CATTTCC 503 




Db 


572 


MM II lllllllllll II 

ACATTACAAAGATGGGCATTGCC 594 





RESULT 9 
AI119218 
LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



AI119218 816 bp mRNA linear EST 02-SEP-1998 

ue94h02.yl Sugano mouse embryo mewa Mus musculus cDNA clone 
IMAGE: 1498803 5' similar to gb:X04482 Mouse mRNA for 
preproinsulin-like growth factor IB (MOUSE) ; , mRNA sequence. 
AI119218 

AI119218.1 GI:3519542 
EST. 

Mus musculus {house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 816) 

Marra # M., Hillier,L., Allen, M., Bowles, M., Dietrich, N. , Dubuque,T., 
Geisel,S., Kucaba,T. , Lacy,M. , Le,M. , Mart in, J., Morris,M., 
Schellenberg,K. , Steptoe,M., Tan, F . , Underwood, K. , Moore, B. , 
Theising,B., Wylie, T. , Lennon,G. , Soares,B. /# Wilson, R. and 
Waterston,R. 

The WashU-HHMI Mouse EST Project 
Unpublished 

Contact: Marra M/Mouse EST Project 

WashU-HHMI Mouse EST Project 

Washington University School of MedicineP 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108 

Tel: 314 286 1800 

Fax: 314 286 1810 

Email: mouseest@watson.wustl.edu 

This clone is available royalty- free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
MGI : 936407 

Seq primer: custom primer used 
High quality sequence stop: 4 73. 

Location/Qualifiers 

1. .816 

/organism="Mus musculus" 

/mol__type="mRNA" . N 

/strain="C57BL" 

/db_xref="taxon: 1009 0" 

/ clone= !I IMAGE : 1498803" 

/dev_stage=" embryo, 14 dpc" 

/lab_host="DH10B" 

/clone_lib=" Sugano mouse embryo mewa" 

/note="Vector: pME18S-FL3; Site_l : Drain (CACTGTGTG) ; 
Site_2: Dralll (CACCATGTG) ; 1st strand cDNA was primed 
with an oligo(dT) primer [ATGTGGCCTTTTTTTTTTTTTTTTT] ; 



double-stranded cDNA was ligated to a Drain adaptor 
[TGTTGGCCTACTGG] , digested and cloned into distinct Drain 
sites of the pME18S-FL3 vector (5' site CACTGTGTG, 3' site 
CACCATGTG) . Xhol should be used to isolate the cDNA 
insert- Size selection was performed to exclude fragments 
<1.5kb. Library constructed by Dr. Sumio Sugano 
(University of Tokyo Institute of Medical Science) - 
Custom primers for sequencing: 5' end primer 
CTT CTG CT CTAAAAGCTG CG and 3 ' end primer 
CGACCTGCAGCTCGAGCACA 

BASE COUNT 230 a 219 c 172 g 187 t 8 others 

ORIGIN 

Query Match 78.9%; Score 425.2; DB 9; Length 816; 

Best Local Similarity 90.7%; Pred. No. 3.2e-89; 

Matches 448; Conservative 0; Mismatches 46; Indels 0; Gaps 0; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

IIIIIIIIMIIIMMIMIilllllllllllll MUNI MMMMil I 

Db 323 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 382 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCT^ 120 

MMMMIMMMMMMMIMM MM. Ml 

Db 383 AGGGGCTTTTACTTC^CAAGCCC^CAGGCTATGGCTCCAGC^TTCGGAGGGCACCTCAG 442 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

II IIIIIIIMIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIMI III II I I III I I 

Db 443 ACAGG CATTGTGGATGAGTGTTG CTTCCGGAG CTGTGAT CTGAGGAGACTGGAGATGTAC 502 

Qy 181 TGTGTCCGCTG(^GCCTACAAAGT(^GCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 240 

MM II I II Ml Mill II 1 1 II II Mill 1 1 II II MM II MM II 

Db 503 TGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCC^GCGCCA 562 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACA^ 300 

1 1 ! I ! 1 1 1 1 1 1 1 1 1 1 T 1 1 1 1 1 1 I IIIIIMIMI IIIIMIM II II Ml Mill I 

Db 563 ATGCCCAAGACTCAGAAGTCCCCGTCCCTATCGACAAAC 622 

Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACC 3 60 

I I I II I I II I II I I I II I IIIMIIIIIIIIIIMMIIIMIM II II I II III II I 

Db 623 . AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGAAGTG CANGAAACAAGACCTA 682 

Qy 3 61 CAGAATGTAGGAGGAG CCTCCCGAGGAACAGAAAATGCCACGTCACCG CAAGAT C CTTTG 42 0 

MIMIIII Ml Mill I II 

Db 683 CAGAATGTANGAGGAG CCTN CCACGGAGCAGAANATG C CACAT CACCG CANGATCCTTTG 742 

Qy 421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAAT^ 480 

IIMIIIIIIMIIIIM II II III Mill II III MM II II III MM 

Db 743 CTGCTTGAGCAACCTGCANAACATCGAAACACCTA 802 

Qy 481 TCATTTCAGAGATG 4 94 

MM I Mill 

Db 8 03 ACATTACCAAGATG 816 



RESULT 10 

AW495481/C 

LOCUS AW4 954 81 



499 bp mRNA linear EST 24-FEB-2000 



DEFINITION UI-M-BH3-auy-g-ll-0-UI .sl NIH_BMAP_M_S4 Mus musculus cDNA clone 

UI-M-BH3-auy-g-ll-0-UI 3', mRNA sequence. 
ACCESSION AW495481 
VERSION AW495481.1 GI: 7065762 

KEYWORDS EST . 

SOURCE Mus musculus (house mouse) 

ORGANISM Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
REFERENCE 1 (bases 1 to 499) 

AUTHORS Bonaldo,M.F. , Lennon,G. and Soares, M.B. 

TITLE Normalization and subtraction: two approaches to facilitate gene 
discovery 

JOURNAL Genome Res. 6 (9), 791-806 (1996) 
MEDLINE 97044477 
PUBMED 8889548 
COMMENT Contact: Chin, H 

National Institute of Mental Health 

6001 Executive Blvd. Room 7N-7190, MSC 9643, Bethesda, MD 

20892-9643, USA 

Tel: 301 443 1706 

Fax: 301 443 9890 

Email: mEST@mail.nih.gov 

The sequence contained an oligo-dT track that was present in the 
oligonucleotide that was used to prime the synthesis of first 
strand cDNA and therefore this may represent a bonaf ide poly A 
tail. The sequence tag present in the cDNA between the NotI site 
and the oligo-dT track served to identify it as a clone from the 
normalized pineal glands library cDNA Library Preparation: M.B. 
Soares Lab Clone distribution: Researchers may obtain BMAP cDNA 
clones from RESEARCH GENETICS. It should be noted that Bento Soares 
is generating a small number of additional specialized 
non-redundant arrays of BMAP cDNAs whose availability will be 
considered under appropriate and limited collaborative arrangements 
Seq primer: M13 Forward 
POL YA- Yes . 

FEATURES Location/Qualifiers 
source 1. .4 99 

/organism="Mus musculus" 
/ mo 1 _t yp e = " mRNA " 
/strain="C57BL/6J" 
/db_xref = '"taxon: 10090" 
/clone="UI-M-BH3-auy-g-ll-0-UI " 
/dev_stage="27-32 days" 
/lab_host= H DH10B (Life Technologies)" 
/ c 1 oneJL ib= " NI H_BMAP_M_S 4 " 

/note= "Vector : pT7T3D-Pac (Pharmacia) with a modified 
polyl inker; Site_l: Not I; Site_2: Eco RI ; The 
NIH_BMAP_M_S4 library is a subtracted library of a series, 
ultimately derived from a mixture of individually tagged 
normalized libraries from ten regions of the mouse brain 
(cerebellum, brain stems, olfactory bulbs, hypothalamus, 
cortex, amygdala, basal ganglia, pineal gland, striatum, 
hipoccampus) after a series of subtractions to reduce the 
representation of cDNAs from which ESTs had already been 
generated. The following serially subtracted libraries 
were generated in this process: NIH_BMAP__M_S4 , 



NIH_BMAP_M__S3 .3, NIH_BMAP_M_S3 . 2 , NIH_BMAP_M_S3 . 1 , 
NIH_BMAP_M_S2 , NIH_BMAP_M_S1 . The subtracted library 
(NIH_BMAP_M_S4) was constructed as follows: PCRamplif ied 
cDNA inserts from NIH_BMAP_M_S3 . 3 , NIH_BMAP_M_S3 . 2 , and 
NIH_BMAP_M__S3 . 1 clones from which 3' ESTs had been derived 
was used as a driver in a hybridization with a pool of 
the NIH_BMAP_M_S3 .3, NIH_BMAP_M_S3 . 2 , and NIH_BMAP_M_S3 . 1 
libraries in the form of single- stranded circles. The 
remaining single- stranded circles (subtracted library) 
was purified by hydroxyapatite column chromatography, 
converted to double -stranded circles and electroporated 
into DH10B bacteria (Lif eTechnologies ) to generate the 
NIH_BMAP_M_S4 library. This procedure has been previously 
described (Bonaldo, Lennon and Soares, Genome Research 
6:791-806, 1996) 
TAG_L I B =N I H_BMAP_M_S 4 
TAG_TISSUE=pineal -glands 
TAG_S EQ = CAGAC " 

BASE COUNT 86 a 112 c 124 g 177 t 

ORIGIN 



Query Match 78.5%; Score 423; DB 9; Length 4 99; 

Best Local Similarity 91.8%; Pred. No. 9.5e-89; 

Matches 44 7; Conservative .0; Mismatches 40; Indels 0; Gaps 0; 
Qy 5 0 TGTGTGGACCAAGGGGCTTTTACTTCAACAAGCCCAC 109 

Ml I III II I IMIMIIMIIIIIIIMMIIIIIII IMIMIMIIIIIIIIIM 

Db 4 99 TGTGTGGACCGAGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGA 44 0 

Qy 110 GGGCACCACAGACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGC 169 

lllllll Mill MMIIMMMMMMIMIMMMMMMIMMIMM I 

Db 439 GGGCAC CT CAGACAGG CATTGTGGATGAGTGTTG CTTCCGGAGCTGTGAT CTGAGGAGAC 380 



Qy 17 0 TGGAGATGTACTGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGC 22 9 

I I I I II I II I I I i I I II i I I I I II I I I I I I I I I I I I I I I I I II I I I I I 

Db 379 TGGAGATGTACTGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGC 320 

Qy 23 0 GCCACACTGACATGCCCAAGACTCAGAAGTCCCAGC^ 289 

IIIIIIIIIIIIMIIIIIIIIIMIIIIIIII I 1 1 II I MINI Mill I III II 

Db 319 GCCACACTGACATGCCCAAGACTCAGAAGTCCCCGTCCCTATCGACAAACAAGAAAACGA 260 



Qy 2 90 AGCTG CAAAGGAGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTG CAGGA 34 9 

M I M I II MM III 1 1 Ml Ml IM I M 1 1 1 1 1 1 1 ! 1 1 1 1 ! M 1 1 1 1 II 1 1 1 1 1 M 

Db 259 AGCTGCAAAGGAGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGAAGTGCAGGA 200 

Qy 350 AACAAGAC CTACAGAATGTAGGAGGAG CCTC CCGAGGAACAGAAAATG CCACGT CACCG C 4 09 

MIIMIMI II I III II III II I Mill MM III MINIM IT lllllll 

Db 199 AACAAGACCTAC^GAATGTAGGAGGAGCCTCCCACGGAGC^GAAAATGCa^CATCACCGC 14 0 

Qy 410 AAGATCCTTTGCTGCTTGAGCAACCTGCAAAAt^TC 469 

I MIMIIIII llllll llllll INN llllllll lllllll lllllll Mill 

Db 13 9 AGGATCCTTTGCTGCTTGAGCAACCTGCAAAACATCGAAACACCTACCAAATA 80 



Qy 47 0 TGAGTTCAATATCATTTCAGAGATGGGCATTTCCCTCAATGAAATACA 52 9 

I III Mill MM II IMIIMIIMMM IMIIMIM IMIMMMMI 

Db 7 9 TAAGTCCAATAACATTACA^GATGGGCATTTCCCCCAATG 20 



536 
13 



RESULT 11 
AI876493/C 

LOCUS AI876493 642 bp mRNA linear EST 21-JUL-1999 

DEFINITION uj59bl0.xl Sugano mouse liver mlia Mus musculus cDNA clone 

IMAGE: 1924219 3' similar to gb : X57025_rnal INSULIN- LIKE GROWTH 
FACTOR IA PRECURSOR (HUMAN); gb:X04482 Mouse mRNA for 
preproinsul in-like growth factor IB (MOUSE);, mRNA sequence. 

ACCESSION AI876493 

VERSION AI876493.1 GI : 5550542 

KEYWORDS EST . 

SOURCE Mus musculus (house mouse) 

ORGANISM Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
REFERENCE 1 (bases 1 to 642) 

AUTHORS Marra,M., Hillier,L., Kucaba,T., Mart in, J., Beck,C, Wylie,T., 

Underwood, K. , Steptoe,M. , Theising,B. , Allen, M. f Bowers, Y., Person 
,B., Swaller,T. # Gibbons, M. f Pape,D., Harvey, N. , Schurk,R. , Ritter 
, E., Kohn,S., Shin,T., Jackson, Y . , Cardenas, M., McCann,R., 
Waterston,R. and Wilson, R. 
TITLE The WashU-NCI Mouse EST Project 1999 

JOURNAL Unpubl i shed 
COMMENT Contact: Marra M/WashU-NCI Mouse EST Project 19 99 

Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108, USA 

Tel: 314 286 1800 

Fax: 314 286 1810 

Email: mouseest@watson.wustl.edu 

This clone is available royalty- free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
MGI: 980511 

Seq primer: custom primer used 
High quality sequence stop: 257. 
FEATURES Location/Qualif iers 

source 1. .64 2 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL" 

/db_xref ="taxon: 10090" 

/clone=" IMAGE: 1924219" 

/sex=" female" 

/dev_stage=" adult" 

/lab_host="DH10B" 

/clone_lib=" Sugano mouse liver mlia" 

/note="0rgan: liver; Vector: pME18S-FL3; Site_l : Drain 
(CACTGTGTG) ; Site_2 : Drain (CACCATGTG) ; 1st strand cDNA 
was primed with an oligo(dT) primer 

[ATGTGGCCTTTTTTTTTTTTTTTTT] ; double-stranded cDNA was 
ligated to a Drain adaptor [TGTTGGCCTACTGG] , digested 
and cloned into distinct Drain sites of the pME18S-FL3 
vector (5' site CACTGTGTG, 3' site CACCATGTG). Xhol should 
be used to isolate the cDNA insert. Size selection was 



Qy 53 0 CCCGGAA 

Db 19 CCAAAAA 



BASE COUNT 
ORIGIN 



127 



performed to exclude fragments <1.5kb. Library 
constructed by Dr. Sumio Sugano (University of Tokyo 
Institute of Medical Science) . Custom primers for 
sequencing: 5' end primer CTT CTG CTCTAAAAG CTG CG and 3 ! end 
primer CGACCTGCAGCTCGAGCACA . " 
a 154 c 175 g 185 t 1 others 



Query Match 78.1%; Score 421.2; DB 9; Length 642; 

Best Local Similarity 91.1%; Pred. No. 2.6e-88; 

Matches 458; Conservative 0; Mismatches 44; Indels 1; Gaps 



1; 



Qy 

Db 



2 GACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCAA 61 

lilllll Mill I II II 1 1 1 Ml MM! 1 1 1 lllllllll Ml II Mill! II I 

503 GACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGGTCGTGTGTGGACCGA 444 



Qy 



Db 



62 GGGGCTTTTACTTC^CAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGCACCACAGA 121 

lllllllll lllllllll I Mill lllllllll lllllll II I III II II MM 

443 GGGGCTTTTTCTTCAACAAGGCCACAGGCTATGGCTCCAGCATTTGGAGGGCACCTCAGA 384 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 



122 CGGGCATTGTGGATGAGTGTTGCTTCCGG-AGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

I I II lllllllll MMMMMMM II III llllllll 

3 83 CAGTCAATGTGGATGAGTGTTGCTTCCGGAAGCTGTGATCTGAGAAGACTGNAGATGTAC 324 
181 TGTGTCCGCTGCAAGCCTACAAAGTC^GCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

MM II lllllllllll MM M M Mill II II 1 1 1 1 1 1 II I II 1 1 1 

323 TGTGCCCCA.CTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGCGCCACACTGAC 2 64 
241 ATGCCCAAGACTCAGAAGTCC(^GCCCCTATCGAC^CACAAGAAAAGGAAGCTGa^AGG 3 00 

II 1 1 II 1 1 II II II 1 1 II II 1 1 I IMIIMMM lllllllll MIMIMIMM 

2 63 ATG CC CAAGACTCAGAAGTCCCCGTGCCTATCGACAAACAAGAAAACGAAG CTG CAAAGG 2 04 

3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGC^ 360 

MMMMMMM MM I M II II I M II M M II I II II II II I II I II II M II 1 1 

2 03 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGAAGTG CAGGAAACAAGAC CTA 144 
361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 420 

1 1 1 1 1 1 1 1 1 1 1 1 f i 1 1 1 M 1 1 1 III MIMIMIMM llllllll MM 

143 CAGAATGTAGGAGGAGCCTCCCACGGAGCAGAAAATGCCACATCACCGCAGGATCCTTTG 84 

421 CTGCTTGAGCAACCTGC!AAAACATCGGAACACCTGCCAAATATCAATAATGAGTTCAATA 480 

Illlllllllllllllllllllllll lllllll lllllll lllllll Ml Mill 

83 CTGCTTGAGCAACCTGCAAAACATCGAAACACCTACCAAAT^ 24 



Qy 



Db 



481 TCATTTCAGAGATGGGCATTTCC 503 

MM II MIMMMIMM 

23 ACATTACAAAGATGGGCATTTCC 1 



■RESULT 12 

AA963258/C 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 



AA963258 525 bp mRNA linear EST 09-MAR-1999 

UI-R-El-gh-f-04-0-UI . si UI-R-E1 Rattus norvegicus cDNA clone 
UI-R-El-gh-f -04-0-UI 3*, mRNA sequence. 
AA963258 

AA963258.1 GI:4278182 
EST. 



SOURCE Rattus norvegicus (Norway rat) 

ORGANISM Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus . 

REFERENCE 1 (bases 1 to 525) 

AUTHORS Bonaldo, M . F . , Lennon,G. and Soares,M.B. 

TITLE Normalization and subtraction: two approaches to facilitate gene 

discovery 

JOURNAL Genome Res. 6 (9), 791-806 (1996) 
MEDLINE 97044477 
PUBMED 8889548 

COMMENT On May 18, 1998 this sequence version replaced gi:3136750. 

Contact: Soares, MB 

Coordinated Laboratory for Computational Genomics 
University of Iowa 

375 Newton Road , 4156 MEBRF , Iowa City, IA 52242, USA 

Tel: 319 335 8250 

Fax: 319 335 9565 

Email: bento-soares@uiowa . edu 

The sequence tag present in the cDNA between the NotI site and the 
oligo-dT track served to identify it as a clone from the normalized 
adult 12-Day-Embryo library. cDNA Library Preparation: M. Fatima 
Bonaldo, Ph.D. Clone distribution: clones will be available through 
Research Genetics 
Seq primer: M13 Forward. 
FEATURES Location/Qualifiers 
source 1 . . 525 

/organism^" Rattus norvegicus" 

/mol_type=" mRNA " 

/ s t r a i n = " Sp r agu e - Da wl ey " 

/db_xref ="taxon: 10116" 

/clone="UI-R-El-gh-f -04-0-UI" 

/dev_stage= " adult " 

/lab_host="DH10B (Life Technologies)" 
/clone_lib="UI-R-El" 

/note= "Vector : pT7T3D-Pac (Pharmacia) with a modified 
polylinker; Site_l: Not I; Site_2: Eco RI ; The UI-R-E1 
library is a subtracted library derived from the UI-R-EO 
library. The UI-R-EO library consisted of a mixture of 
individually tagged normalized libraries constructed from 
8, 12 and 18 -day embryo. The tag is a string of 3-5 
nucleotides present between the Not I site and the 
oligo-dT track which allows identification of the library 
of origin of a clone within the mixture. The subtracted 
library (UI-R-E1) was constructed as follows: PCR 
amplified cDNA inserts from a pool of UI-R-EO clones from 
which 3 1 ESTs had been derived was used as a driver in a 
hybridization with the UI-R-EO library in the form of 
single-stranded circles. The remaining single-stranded 
circles (subtracted library) was purified by 
hydroxyapatite column chromatography, converted to 
double- stranded circles and electroporated into DH10B 
bacteria (Life Technologies) to generate the UI-R-E1 
library. This procedure has been previously described 
(Bonaldo, Lennon and Soares, Genome Research 6: 791-806, 
1996)" 



BASE COUNT 
ORIGIN 



107 a 133 c 126 g 159 t 



Query Match 74.7%; Score 402.6; DB 9; Length 525; 

Best Local Similarity 88.5%; Pred. No. 5.8e-84; 

Matches 470; Conservative 0; Mismatches 9; Indels 52; Gaps 1; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 521 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 4 62 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTC 120 

Ml I II II I MINI MINIMI 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 

Db 461 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCC^GCATTCGGAGGGCACCACAG 4 02 

Qy 121 ACGGGCATTGTGGATGAGTGTTG CTTC CGGAG CTGTGATCTGAGGAGG CTGGAGATGTAC 180 

MIIMMM IMIMIMM IMIMMIM MMIIMM MIMIMII M 

Db 401 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 342 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAG 240 

I I I I I 

Db 341 TGTGCTCCGCTGAAGCCTAC^VAAGTC^GCTCGTTCCATCCGGGCCC^GCGCCACACTGAC 282 

Qy 241 ATG CCCAAGACTCAGAAGTCCCAG C CCCTAT CGACACACAAGAAAAGGAAG CTG CAAAGG 300 

MIMIIIMIIIII 

Db 281 ATGCCCAAGACTCAG 267 

Qy 301 AGAAGGAAAGGAAGTACAGTTGAAGAACACAAGTAGAGGAAGTGCAGGAAAC^^ 360 

I MIIIIIMI MMIMIMI MIMMIM MIIMMM Ml 

Db 266 AAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 214 

Qy 3 61 CAGAATGTAGGAGGAG C CT C CCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 420 

III IIMIIMIIIIIIIIIMMIIIIIMMIIIII Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 213 CAGAATGTAGGAGGAG CCTCCCGAGGAACAGAAAATGC CACGT CAC CGCAAGAT C CTTTG 154 

Qy 421 CTGCTTGAGCAACCTGCAAAACATCGGAACAC 48 0 

1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 i I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 k ^ 

Db 153 CTGCTTGAG CAACCTG CAAAACATCGGAACAC CTG CCAAATAT CAATAATGAGTT CAATA 94 

Qy 481 T(^TTT(^GAGATGGGCATTTCCCTCAATGAAATA(^C^GTAAAC^TTCC 531 

IMIIMI MMMMMMMMMMI Mill II MM Mllillll 

Db 93 CCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCC 43 



RESULT 13 
AA945553/C 

LOCUS AA945553 500 bp mRNA linear EST 08-JAN-1999 

DEFINITION EST201052 Normalized rat liver, Bento Soares Rattus sp . cDNA clone 

RLIA083 3' end, mRNA sequence. 
ACCESSION AA94 5553 

VERSION AA945553.1 GI: 4132547 

KEYWORDS . EST. 
SOURCE Rattus sp . 

ORGANISM Rattus sp . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; 

Rattus . 



REFERENCE 1 {bases 1 to 500) 

AUTHORS Lee,N.H., Glodek,A., Chandra, I., Mason, T.M. , Quackenbush, J - , 

Kerlavage,A.R. and Adams , M . D . 
TITLE Rat Genome Project: Generation of a Rat EST (REST) Catalog & Rat 

Gene Index 
JOURNAL Unpublished 
COMMENT On May 1, 1998 this sequence version replaced gi: 3105469. 

Contact: Lee, NH 

The Institute for Genomic Research 

9712, Medical Center Drive, Rockville, MD 20850, USA 
Tel: (301) -838-3529 
Fax: (301) -838-0208 
Email : nhlee@tigr.org 
Seq primer: M13-21. 
FEATURES Location/Qualifiers 
source 1. .500 

/organism="Rattus sp. " 

/mol_type= n mRNA" 

/db_xref ="taxon: 10118" 

/clone-"RLIA083" 

/clone_lib="Normalized rat liver, Bento Soares" 
/note="0rgan: liver; Vector: pT7T3Pac; Site_l: EcoRI ; 
Site_2: Not I" 

BASE COUNT 99 a 129 c 130 g 142 t 

ORIGIN 

Query Match 74.5%; Score 401.6; DB 9; Length 500; 

Best Local Similarity 88.5%; Pred. No. 9.8e-84; 

Matches 4 69; Conservative 0; Mismatches 9; Indels 52; Gaps 1; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

IIIMIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMII 

Db 4 78 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 419 

Qy 61 AGGGGCTTTTACTTCAACAAGCCGACAGTCTATGGCTCCAGC^ 120 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 IIIIIIMIIIIIIIIIIIIIIIIIIIMM 

Db 418 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCC^GCATTCGGAGGGCA 359 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMMIIIIIMIMIIIMIIII 

Db 358 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 2 99 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCC^ 24 0 

1 1 1 1 I IIMIIMIIIIIIIIIIIMIIIIMMIIIIIIIIMIMIMIII 

Db 2 98 TGTGCTCCGCTGAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 239 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACA.CAAGA 3 00 

I I 1 I I t I 1 I I t 1 I I I 

Db 238 ATGCCCAAGACTCAG --- 224 

Qy 301 AGAAGGAAAGGAAGTACACTTGAAGAA(^C^GTAGAGGAAGTGCAGGAAACAAGACCTA 3 60 

IIMIMlllllllMIIIIIIMIIIIIIIIIIMIIIIIIMIIMMIM 

Db 223 AAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 171 

Qy 361 CAGAATGTAGGAGGAGCCT C CCGAGGAACAGAAAATGCCACGT CAC CG CAAGATCCTTTG 420 

1 1 1 1 1 1 1 1 1 Ml 1 1 III II 1 1 M 1 1 MM 1 1 II I IN M ! II 1 1 1 1 1 1 II 

Db 17 0 CAGAATGTAGGAGGAGCCTCC CGAGGAACAGAAAATGCCACGTCAC CG CAAGAT CCTTTG 111 



Qy 421 CTGCTTGAGCAACCTGCAAAA.CATCGGAA.CACCTGCay^TATC^TAATGAGTTCAATA 480 

I ; I MM ' i I Ml ! Ml Ml I Ml U ,M I I M 

Db 110 CTGCTTGAGCAACCTGCA^ 51 

Qy 481 T(^TTTC^GAGATGGGCATTTCCCT(^TGAAATACACAAGTAAACATTC 53 0 

1 1 1 1 M 1 1 1 1 1 , 1 M 1 1 1 1 1 1 1 1 1 1 i M I ! 1 1 M 1 1 1 1 1 1 : ! : I 

Db .50 CCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTC 1 



RESULT 14 

AI59975l/c 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



AI599751 525 bp mRNA linear EST 21-APR-1999 

EST251454 Normalized rat embryo, Bento Soares Rattus sp. cDNA clone 
REMEG8 6 3' end, mRNA sequence. 
AI599751 

AI599751.1 GI:4608799 
EST . 

Rattus sp. 
Rattus sp. 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus . 

1 (bases 1 to 525) 

Lee,N..H., Glodek,A., Chandra, I., Mason, T.M. # Quackenbush, J. , 
Ker 1 avage , A . R . and Adams, M.D. 

Rat Genome Project: Generation of a Rat EST (REST) Catalog & Rat 
Gene Index 
Unpublished 
Contact: Lee, NH 

The Institute for Genomic Research 

9712, Medical Center Drive, Rockville, MD 20850, USA 
Tel: (301) -838-3529 
Fax: (301) -838-0208 
Email: nhlee@tigr.org 
Seq primer: M13-21. 

Location/Qualifiers 

1. .525 

/organism= "Rattus sp." 
/mol_type=" mRNA " 
/db_xref="taxon: 10118" 
/clone="REMEG86" 

/dev_stage=" embryo 8, 12, 18 dpc" 
/clone__lib=" Normalized rat embryo, Berito Soares" 
/note= n Vector: pT7T3Pac; Site_l : EcoRI ; Site_2: Not I" 
107 a 133 c 140 g 145 t 



Query Match 74.1%; Score 3 99.4; DB 9; Length 525; 

Best Local Similarity 88.1%; Pred. No. 3.2e-83; 

Matches 468; Conservative 0; Mismatches 11; Indels 52; 



Gaps 



1; 



Qy 



Db 



1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Mill lllllllllllllllllllllllllllllll.il Mil II 1 1 II I II II I II I II 

48 0 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAATTCGTGTGTGGACCA 421 



QY 



61 AGGGGCTTTTACTTCAACAAGCCCA(^GTCTATGGCTCCAGCATTCGGAGGGC^CCACAG 12 0 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 



MM 1 1 1 III I! I II I 1 1 1 II , 1 1 1 1 1 II 1 1 Ml II 1 1 1 1 1 1 II Ml 1 1 1 

42 0 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGC^ 361 
121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

M 1 1 1 M I MM I MM M 1 1 1 II I M 1 1 1 1 II I II I M Ml II I IM 1 1 1 1 II I II * 

360 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 3 01 

181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

3 0 0 TGTGCTCCGCTGAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACATTGAC 241 

241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAAAAGGAAGCTGCAAAGG 300 

IIIIIIIIIMIIII 

24 0 ATGCCCAAGACTCAG : 22 6 

3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 36 0 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

225 AAGGAAGTACACTTGAAGAACA.CAAGTAGAGGAAGTGCAGGAAACAAGACCTA 173 

361 CAGAATGTAGGAGGAG CCT CCCGAGGAACAGAAAATG CCACGT CAC CG CAAGATC CTTTG 42 0 

Mi 1 1 1 Ml 1 1 Ml I IMI M 1 1 1 1 1 1 1 Ml 1 1 M 1 1 M 1 1 1 1 1 1 Ml 1 1 MM 

172 CAGAATGTAGGAGGAG CCT CCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 113 
421 CTGCTTGAGCAACCTGCAAAACATCGGAACACC^ 48 0 

1 1 1 1 MM IM 1 1 II 1 1 1 IM 1 1 1 1 1 1 III II M M 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 

112 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCAATAATGAGTTt^TA 53 
481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCC 531 

lllllilll III Mill 1 1 1 1 1 II M M M I II M I 1 1 1 1 1 1 Ml 

52 CCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCC 2 



RESULT 15 

BI294072/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



BI294072 499 bp mRNA linear EST 19-JUL-2001 

UI-R-DKO-cej -b-03-0-UI . si UI-R-DK0 Rattus norvegicus cDNA clone 
UI-R-DKO-cej -b-03-0-UI 3', mRNA sequence. 
BI294072 

BI294072.1 GI:14956179 
EST. 

Rattus norvegicus (Norway rat) 
Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; 
Rattus . 

1 (bases 1 to 499) 

Bonaldo , M . F . , Lennon,G. and Soares,M.B. 

Normalization and subtraction: two approaches to facilitate gene 
discovery 

Genome Res. 6 (9), 791-806 (1996) 

97044477 

8889548 

Contact: Soares, MB 

Coordinated Laboratory for Computational Genomics 
University of Iowa 

375 Newton Road , 4156 MEBRF, Iowa City # IA 52242, USA 
Tel: 319 335 8250 



Fax: 319 335 9565 

Email : bento-soares@uiowa . edu 

The sequence contained an oligo-dT track that was present in the 
oligonucleotide that was used to prime the synthesis of first 
strand cDNA and therefore this may represent a bonafide poly A 
tail. The sequence tag present in the cDNA between the NotI site 
and the oligo-dT track served to identify it as a clone from the 
normalized rat heart pool library cDNA Library Preparation: M.B. 
Scares Lab Clone distribution: clones will be available through 
Research Genetics (www.resgen.com) 
Seq primer: M13 Forward 
POLYA-Yes . 

FEATURES Location/Qualifiers 
source 1. .4 99 

/organism="Rattus norvegicus" 
/ mol_type=" mRNA" 
/strain="Sprague-Dawley M 
/db_xref = " t axon : 1 0 1 1 6 " 
/clone="UI-R-DKO-cej -b-03-O-UI" 
/ dev_s t age = " ADULT " 

/lab_host="DH10B (Life Technologies) " 
/clone_lib= M UI-R-DKO n 

/note="Vector : pT7T3D-Pac (Pharmacia) with a modified 
polylinker; Site_l : Not I; Site_2 : Eco RI"; The UI-R-DKO 
library is a subtracted library derived from a mixture of 
five individually tagged normalized rat libraries: 
brain-nRBP (20%), heart-nRHP (20%), kidney-nRKP (20%) , 
aorta-nRAP (20%) , and placenta-nRPP (20%) . Each original 
library was constructed from a mixture of equal amounts of 
RNA from seven different developmental time-points: 
embryonic day 17, embryonic day 19, embryonic day 21, 
adult day 1, adult day 12, adult day 75, and adult day 
200. (Exception: the aorta pool does not contain embryonic 
day 17 RNA and the placenta pool contains only the three 
embryonic stages) . Each library was normalized 
individually according to the procedure described by 
Bonaldo, Lennon & Soares (Genome Research Genome 6: 
791-806, 1996) . For construction of the DK0 subtracted 
library, plasmid DNA from each of the five individually 
tagged normalized libraries was mixed in the proportions 
specified above and electroporated into competent bacteria 
for production of single-stranded circular DNA 
representing the pool of libraries. Single-stranded 
circular DNA representing these five normalized libraries 
was then used as a tracer in a subtractive hybridization 
with a driver (PCR amplified inserts from a plasmid DNA 
template preparation) comprising: a) a set of about 1,000 
arrayed clones from each of the five non-normalized 
libraries of brain (CTOs) , heart (CSOs) , kidney (CUOs) , 
aorta (CWOs) , and placenta (CXOs) . The resulting pool of 
approximately 5,000 clones represented about 33.3% of the 
final driver population. A set of about 2,000 arrayed 
clones from each of the five normalized libraries of brain 
(CT0), heart (CS0) , kidney (CU0) , aorta (CW0) , and 
placenta (CX0) . The resulting pool of about 10,000 clones 
represented about 66.6% of the final driver population. 
TAG LIB=UI-R-DK0 



TAG_TISSUE=rat heart pool 

TAG_S EQ =ATAAGATAAC " 
BASE COUNT 97 a 124 c 121 g 156 t 1 others 

ORIGIN 

\ 

Query Match 73.3%; Score 395; DB 12; Length 499; 

Best Local Similarity 88.0%; Pred. No. 3.4e-82; 

Matches 4 63; Conservative 0; Mismatches 11; Indels 52; Gaps 1; 
Qy 6 AGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCAAGGGG 65 

MINIM II II MUM II II II II M Ml II I II M MM III II II I II II II I 

Db 499 AGAGACCCTNTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCAAGGGG 44 0 

Qy 66 CTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGCA 125 

II II 1 1 1 1 1 1 IM 1 1 Ml MM M I M 1 1 1 1 M II I II I II 1 1 1 M 1 1 M II II I II 

Db 43 9 CTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGC^ 38 0 

Qy 12 6 CATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTACTGTGT 185 

Ml M 1 1 1 1 M II 1 1 1 1 1 1 1 M I IM 1 1 1 MM IM 1 1 1 1 1 M MUM IM 1 1 

Db 3 79 CATTGTGGATGAGTGTTGCTTCCGGAGCTGTTATCTGAGGAGGCTGGAGATGTACTGTGC 32 0 

Qy 18 6 CCGCTGC^GCCTACAAAGTCAGCTCGTTCC^TCCGGGCCCAGCGCCAC^CTGAaVTGCC 24 5 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I II I I I I I I I I I I I I I I I II I I I 

Db 319 TCCGCTGAAGCCTACAAAGTCAGCTCGTTCCA 260 

Qy 246 CAAGACT CAGAAGT CC CAG CCCCTAT CGACACACAAGAAAAGGAAGCTGCAAAGGAGAAG 305 

MINIUM 

Db 259 CAAGACTCAG 250 

Qy 3 06 GAAAGGAAGTACA CTTGAAGAA CA CAAGTAGAGGAAG TG CAGGAAA CAAGA CCTA CAGAA 365 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 

Db 24 9 - - AAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTG CAGGAAA CAAGACCTA CAGAA 192 

Qy 366 TGTAGGAGGAG C CTC C CGAGGAACAGAAAATG C CACGTCACCG CAAGATCCTTTGCTGCT 425 

I M 1 1 II 1 1 1 1 II I M I M II 1 1 1 1 1 1 II 1 1 M 1 1 M I M II M I II 1 1 1 1 1 1 1 1 1 II I 

Db 191 TGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTGCTGCT 132 

Qy 426 TGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCAATAATC^ 485 

I M 1 1 II I M I II 1 1 M 1 1 II 1 1 1 1 1 1 M I M M I M 1 1 1 II M I II 1 1 1 1 1 M 1 1 1 1 

Db 131 TGAGCAACCTGCAAAACATCGGAACACCTGCCA^ 72 

Qy 48 6 TCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCC 531 

I II 1 1 II 1 1 II II I III I II 1 1 1 1 1 1 1 II I IM 1 1 1 1 1 M II 1 1 

Db 71 TCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCC 26 



Search completed: December 13, 2003, 07:29:50 
Job time : 1758.93 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM riucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



December 13, 2003, 05:41:20 ; Search time 2408.26 Seconds 

(without alignments) 
9156.102 Million cell updates/sec 

US-09-852-261-3 
539 

1 ggaccagagaccctttgcgg agtaaacattcccggaattc 539 

I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 

2888711 seqs, 20454813386 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



5777422 



Database 



GenEmbl : * 

1 : gb_ba : * 

2 : gb_htg : * 

3 : gb_in : * 

4 : gb_om : * 

5 : gb_ov : * 

6 : gb_pat : * 

7 : gb_ph : * 

8: gb_pl:* 

9 : gb_pr : * 

10: gb_ro:* 

11: gb_sts:* 

12: gb_sy:* 

13: gb_un:* 

14 : gb_vi : * 

1 5 : em_ba : * 

16: em_fun:* 

1 7 : em_hum : * 

18: em_in:* 

1 9 : em_mu : * 

2 0 : em_om : * 

21: em_or:* 

22: em_ov:* 

23 : em_pat : * 

24: em_ph:* 

25: em_pl:* 

2 6 : em_ro : * 

27 : em sts : * 
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trill VI . 
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trill nty riuni : 


1 1 


em nty inv : w 




em nug o cner : 


O — ) 


trill IlUy HILlo - 


34 


em_htg_pln : * 


35 


em_htg_rod : * 


36 


em htg mam: * 


37 


em htg vrt : * 


38 


em sy:* 


39 


em_htgo_hum : * 


40 


em_htgo__mus : * 


41 


em htgo other:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to. the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

o. 
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Match Length DB 
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1 
-L 


j -5 y 


100 


0 


539 


6 


AVI All A A 


aa!4 / /44 bequence 




J J 1/ 


100 


0 


539 


6 


iiAO UU / ol 


aaj5uu/oi sequence 


3 


535.8 


99 


4 


710 


10 


RATTHFTA 


Ml^iAftn Pat- lnqnl in 


4 


516.6 


.95 


8 


798 


10 


RNIGFI2 


X06108 Rat mRNA (c 


5 


516.6 


95 


8 


958 


10 


RNIGFI1 


X06107 Rat mRNA (c 


6 


470 .2 


87 


2 


1536 


10 


BC012409 


BC012409 Mus muscu 


7 


453 .8 


84 


2 


651 


10 


. MM IGF I BR 


X044 82 Mouse mRNA 


8 


425 


78 


8 


487 


6 


AX147752 


AX147752 Sequence 


9 


425 


78 


8 


487 


6 


AX300789 


AX30078 9 Sequence 
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78 


3 
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74 


7 
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74 
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67 


1 
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AX147746 Sequence 


17 


356.8 


66 


2 


523 


6 


AX300783 


AX300783 Sequence 


18 


325.2 


60 


3 


517 


6 


AX147742 


AX147742 Sequence 


19 


325.2 


60 


3 


517 
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9 
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6 
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6 
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9 


HUMIGFI 
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9 
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6 
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6 
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4 
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42 , 


.7 
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6 
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E05279 DNA encodin 
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42 . 
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10 


AF440694 


AF440694 Mus muscu 


44 
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42. 


.4 


836 


10 


CPIGF1 


X52951 Guinea pig 


45 
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42 . 


.4 


1284 


4 


BTILGF1A 


X15726 Bovine mRNA 



ALIGNMENTS 



RESULT 1 
AX147744 

LOCUS AX147744 539 bp DNA linear PAT 31-AUG-2001 

DEFINITION Sequence 3 from Patent WO0136483. 
ACCESSION AX147744 

VERSION AX147744.1 GI : 14346789 

KEYWORDS 

SOURCE Rattus norvegicus (Norway rat) 

ORGANISM Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Crania ta; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; 
Rattus . 
REFERENCE 1 

AUTHORS Goldspink,G.R. and Johnson , I . R . 

TITLE Use of the insul in- like-growth factor i isoform mgf for the 

treatment of neurological disorders 
JOURNAL Patent: WO 0136483-A 3 25-MAY-2001; 
University College London (GB) 
FEATURES Location/Qualifiers 
source 1. .539 

/organism^" Rattus norvegicus" 
/mol_type= "genomic DNA" 
/db_xref="taxon: 10116" 
CDS <1. .336 

/not e=" unnamed protein product" 
/ codon_start=l 
/protein_id="CAC41176 . 1" 
/db_xref ="GI : 14346790" 
/db_xref = " REMTREMBL : CAC4 1176" 

/translation- "GPETLCGAELVDALQFVCGPRGFYFNKPTVYGSSIRRAPQTGIV 
DECCFRSCDLRRLEMYCVRCKPTKSARSIRAQRHTDMPKTQKSQPLSTHKKRKLQRRR 
KGSTLEEHK" 

BASE COUNT 161 a 136 c 139 g 103 t 

ORIGIN 

Query Match 100.0%; Score 53 9; DB 6; Length 53 9; 

Best Local Similarity 100.0%; Pred. No. 5.5e-157; 

Matches 539; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 



1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 6 0 



Db 



60 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



61 AGGGGCTTTTACTTCAACAAGCCCAOAGTCTATGGCTCCAGCA^ 120 

1 1 II 1 1 1 1 1 1 1 1 1 1 III I! I II I II I III MM 1 1 1 1 1 1 1 1 1 1 1 1 1! 1 1 Ml M I II I 

61 AGGGGCTTTTACTTCAAC^GCCCACAGTCTATGGCTCCAGCATTCGGAGGGC^CCAaVG 12 0 
121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

I I I M I I 1 1 I II I I I I M I I I I I I I M I I I I I I I I I I II I I I I I I I I I II I II I II I I 

121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 
181 TGTGTCCGCTGC^GCCTAOWVGTCAGCTCGTTCC^TCCGGGCCCAGCGCCACACTGAC 24 0 

1 1 1 1 1 M MM 1 1 1 1 M 1 1 1 1 M I II 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 II Ml 1 1 1 1 1 III 1 1 M 

181 TGTGTCCGCTGCAAGCCTAaw^GTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 
241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAA 3 00 

1 1 1 II MM 1 1 1 1 1 MM 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IM 1 1 1 

241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACAC^CAAGAAAAGGAAGCTGCAAAGG 300 
3 01 AGAAGGAAAGGAAGTAC^CTTGAAGAACACAAGTAGAGGAAGT^ 360 

II M I II MM I II II I II I M I IM 1 1 1 II I II I II I Mil M III II I II 1 1 1 II I 

3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 360 

361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 42 0 

I I 1 I I I I I I I I I I I I I I M' I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I 

361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 42 0 

421 CTGCTTGAGCAACCTGCAAAACATCGGAACA 480 

MINI IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII III Mill MM 
421 CTG CTTGAG CAA C CTG CAAAACAT CGGAACACCTGC CAAATAT CAATAATGAGTTCAATA 48 0 

481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCCCGGAATTC 539 

I Ml II 1 1 1 1 1 M I IM II I II M 1 1 MMI II 1 1 1 1 1 1 1 IMM III 1 1 II M 1 1 1 

4 81 TCATTTCAGAGATGGG CATTTCCCTCAATGAAATACACAAGTAAACATT C C CGGAATTC 53 9 



RESULT 2 
AX300781 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



AX300781 539 bp DNA linear PAT 30-NOV-2001 

Sequence 3 from Patent WO0185781. 

AX300781 

AX300781.1 GI:17382062 

Rattus sp. 
Rattus sp. 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; 
Rattus. 
1 

Goldspink,G.D. and Terenghi , G . B . 

Repair of nerve damage 

Patent: WO 0185781-A 3 15 -NOV-2001 ; 

University College London (GB) ; East Grinstead Medical Research 
Trust (GB) 

Location/Qualifiers 

1. ,539 

/organism^" Rattus sp . " 



/mol_type= "genomic DNA" 
/db_xref="taxon: 10118" 
CDS <1. .336 

/not e= "unnamed protein product" 
/codon_start=l 
/protein_id="CAD13041 . 1" 
/db_xref="GI : 17382063" 

/ 1 rans 1 a t i on- " G PETLCGAELVDALQFVCGPRGF YFNKPTVYGS S I RRAPQTG I V 
DECCFRSCDLRRLEMYCVRCKPTKSARSIRAQRHTDMPKTQKSQPLSTHKKRKLQRRR 
KGSTLEEHK" 

BASE COUNT 161 a 13 6 C 13 9 g 103 t 

ORIGIN 

Query Match 100.0%; Score 539; DB 6; Length 539; 

Best Local Similarity 100.0%; Pred. No. 5.5e-157 ; 

Matches 53 9; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

I I I I I 1 I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGC^ 12 0 

II I MM I M 1 1 II I II II M 1 1 M I M IM I II I M 1 1 1 1 1 1 1 1 1 1 MMM I 

Db 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCA^ 12 0 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

1 1 M 1 1 1 II I M 1 1 M 1 1 M Ml M 1 1 Ml 1 1 M 1 1 MM Ml M I M M MM 

Db 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGT(^GCTCGTTCCATCCGGGCC(^GCGC(^CACTGAC 240 

1 1 II 1 1 1 IM II 1 1 M 1 1 II M 1 1 1 1 Ml 1 1 1 1 I M Ml IMM 1 1 1 Ml M 

Db 181 TGTGTCCGCTGCAAGCCTACAAAGTOVGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 240 

Qy 241 ATGCCCAAGACTCAGAkGTCCCAGCCCCTATC^^ 3 00 

II M 1 1 1 II I M 1 1 I M I M 1 1 1 1 1 1 1 II M 1 1 1 1 1 Ml M I M 1 1 1 M I M 

Db 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACAC^ 3 00 

Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAACAC^GTAGAGGAAGTGCAGGAAACAA 3 60 

M II I! II Ml I M I M IM 1 1 1 M I M I II M 1 1 1 1 1 Ml MMM 1 1 M I M 

Db 301 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGC^ 3 60 

Qy 361 (^GAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTC^CCGCAAGATCCTTTG 420 

I MM M 1 1 1 II 1 1 1 1 III M I II II I M II I II : M 1 1 1 1 1 II I M II I M I M 

Db 361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTt^CCGC^AGATCCTTTG 42 0 

Qy 421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTC 480 

II II II I Ml M 1 1 IM 1 1 M Ml II II I II II I I M 1 1 II II 1 1 M Mill I 

Db 421 CTGCTTGAGCAACCTGCAAAACATCGGAACA 480 

Qy 481 TCATTTC^GAGATGGGO\TTTCCCTCAATGAAATACAC^GTAAACATTCCCGGAATTC 53 9 

II Ml II M 1 1 M I II II II 1 1 IM M IM M I M 1 1 IMM MMM 1 1 1 1 II 

Db 481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACAC^GTAAACATTCCCGGAATTC 53 9 



RESULT 3 
RATIGFIA 
LOCUS 



RATIGFIA 



710 bp mRNA linear ROD 2 7 -APR- 19 93 



DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



FEATURES 

source 



CDS 



sig_peptide 
mat_peptide 
mat_peptide 
mat_peptide 
mat_jpeptide 
mat_peptide 



BASE COUNT 
ORIGIN 



Rat insulin-like growth factor I (IGF- I) mRNA, complete cds . 
M15480 

M15480.1 GI:204749 

growth factor; insulin-like growth factor. 
Rattus norvegicus (Norway rat) 
Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus . 

1* (bases 1 to 710) 

Roberts, C.T. Jr., Lasky,S.R., Lowe,W.L. Jr., Seaman, W.. T. and 
LeRoith,D. 

Molecular cloning of rat insulin-like growth factor I complementary 

deoxyribonucleic acids: differential messenger ribonucleic acid 

processing and regulation by growth hormone in extrahepatic tissues 

Mol. Endocrinol. 1 (3), 243-248 (1987) 

88288198 

3453891 

Original source text: Rat (Sprague-Dawley) adult liver cDNA to 
mRNA, clone pRIGF-1-42 . 

Draft entry and computer -readable copy of sequence in [Mol. 
Endocrinol. (1987) In press] kindly 
provided by S.R.Lasky, 16-MAR-1987. 

Location/Qualifiers 

1. .710 

/organism= "Rattus norvegicus" 
/mol_type= "mRNA" 
/db_xref="taxon: 10116" 
106. .507 

/note=" insulin- like growth factor precursor" 
/codon_start=l 
/protein_id="AAA41385 . 1" 
/db_xref="GI : 204750" 

/ trans la tion= "MSSSHLFYLALCLLTFTSSATAGPETLCGAELVDALQFVCGPRG 
FYFNKPTGYGSS I RRAPQTG I VDECCFRSCDLRRLEMYCVRCKPTKSARS I RAQRHTD 
MPKTQKSQPLSTHKKRKLQRRRKGSTLEEHK" ' 
106. .171 

/not e=" insulin- like growth factor signal peptide" 
172. .258 

/product=" insulin- like growth factor B peptide" 
259. .294 

/product^" insulin- like growth factor C peptide" 
295. .357 

/product =" insulin- like growth factor A peptide" 
358. .381 

/product^" insulin-like growth factor D peptide" 
382. .504 

/product^" insulin- like growth factor E peptide" 
202 a 191 c 172 g 145 t 
1 bp upstream of EcoR'I site. 



Query Match 9 9.4%; Score 535.8; DB 10; Length 710; 

Best Local Similarity 99.6%; Pred. No. 5.8e-156; 

Matches 537; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 



Qy 



1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

1 1 M 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 II 1 1 



Db 



172 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 231 



Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGCACCACAG 120 

MIIIIIIIIIIIIIIIIIMIIIIMI lllllllllllllllllllllllllllllll 

Db 232 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCACAG 2 91 

Qy 121 ACGGGCATTGTGGATGAGTGTTG CTT CCGGAG CTGTGAT CTGAGGAGG CTGGAGATGTAC 18 0 

MINIMI II I lilMIIIIMIIMIIIIIIMIIIIIMIIII I Mill MINIM 

Db 2 92 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 351 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

MlMIMI IMIIMMI MIMIIMM MIMIIMM IIMIMMIM 

Db 352 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 411 

Qy 241 ATG CCCAAGACT CAGAAGTCCCAGCC CCTATCGACACACAAGAAAAGGAAG CTG CAAAGG 3 00 

1 1 1 1 1 1 1 i 1 1 MM 1 1 1 M 1 1 1 Ml 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 M 1 1! I II I 

Db 412 ATG CCCAAGACT CAGAAGTCCCAGCC CCTATCGACACACAAGAAAAGGAAG CTG CAAA.GG 471 

Qy 3 01 AGAAGGAAAGGAAGTACA,CTTGAAGAACACAAGTAGAGGAAGTG(^GGAAACAAGACCTA 3 60 

M 1 1 1 1 M 1 1 1 Ml 1 1 1 1 M I MM M I II M 1 1 1 1 M 1 1 1 1 1 1 1 M I M 1 1 1 1 

Db 4 72 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 531 

Qy 3 61 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I i II I II I M I I I I I I I I I I I I I I I I I I 

Db 532 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 591 

Qy 421 CTGCTTGAGC^CCTGC^W^CATCGGAA 4 80 

Ml 1 1 1 M I II Ml 1 1 1 1 M 1 1 Ml I M 1 1 M 1 1 1 1 M 1 1 1 1 1 II II 1 1 1 MM 

Db 5 92 CTGCTTGAGCAACCTGCAAAACA.TCGGAACACCTGCCAAATATCAATAATGAGTTCA 651 

Qy 481 TCATTTCAGAGATGGGCA.TTTCCCTCAATGAAATACACAAGTAAACATTCCCGGAATTC 53 9 

I I I I I I I I I I Ml I I I I I I I I I II I I I I I I I I M I I I I I I I I I I I I I I I II I I I I I M 

Db 652 CCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCCCGGAATTC 710 



RESULT 4 

RNIGFI2 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



RNIGFI2 798 bp mRNA linear ROD 12-SEP-1993 

Rat mRNA (clone IGF1AB2) for insulin-like growth factor I. 

X06108 M32339 Y00429 

X06108.1 GI:56426 

insulin- like growth factor I. 

Rattus norvegicus (Norway rat) 

Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus. 
1 

Shimatsu,A. and Rotwein,P. 

Sequence of Two Rat Insulin-like Growth Factor I mRNAs Differing 

Within the 5 1 Untranslated Region 

Nucleic Acids Res. 15 (1987) In press 

2 (bases 1 to 798) 

Rotwein,P. 

Direct Submission 

Submitted (21-OCT-1987) Rotwein P., Washington University, School 
of Medicine, 660 South Euclid Avenue, Box 8127, St. Louis, MO 



COMMENT 

FEATURES 

source 



misc feature 



CDS 



misc feature 



BASE COUNT 
ORIGIN 



63110, USA 

Another IGF-I mRNA of rat liver differing in the 5' UT-region is 
described in <X06107>. 

Location/Qualifiers 

1. .798 

/organism="Rattus norvegicus" 
/ mo 1 _ t yp e = " mRNA " 
/ s t ra in= " Sprague-Dawl ey " 
/db_xref="taxon: 10116" 
/ 1 i s sue_type= " 1 iver " 
1. .72 

/note="5' UT-region" 
73. .552 

/note="IGF-I (AA 1-159) " 
/codon_start=l 
/protein_id="CAA294 81 . 1" 
/db_xref="GI : 56427" 

/ trans la t ion= " MGKI SSLPTQLFKI CLCDFLKI KI HIMSSSHLFYLALCLLTFTS 
SATAGPETLCGAELVDALQFVCGPRGFYFNKPTGYGSS I RRAPQTGI VDECCFRSCDL 
RRLEMYCAPLKPTKSARSIRAQRHTDMPKTQKSQPLSTHKKRKLQRRRKGSTLEEHK" 
553. .798 

/note= n 3' UT-region" 
238 a 196 c 183 g 181 t 



Query Match 95.8%; Score 516.6; DB 10; Length 798; 

Best Local Similarity 98.3%; Pred. No. 5.9e-150; 

Matches 522; Conservative 0; Mismatches 9; Indels 0; Gaps 



0; 



Qy 


1 


GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 


60 


Db 


217 


II M M M 1 M Ml M II Ml M Ml 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 MM 1 M 1 

GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 


276 


Qy 


61 


AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCT^ 

III III II II IIMIIMM MINIM MMIMMIMM I IMIIIMI 

AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGC^TTC^ 


120 


Db 


277 


336 


Qy 


121 


ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 


180 


Db 


337 


II 1 1 II 1 1 1 1 1 1 M II 1 II 1 M II M II II 1 II II II M 1 II M 1 1 1 1 1 1 II 1 1 M 1 M 1 

ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 


396 


Qy 


181 


TGTGTCCGCTGCAAGCCTACAAAGT(^GCTCGTTCCATCCGGGCC(^GCGCCACACTGAC 


240 


Db 


397 


Ml! I II MM 1 II MM 1 MM IMMM II MM Mill MM II Ml III 

TGTGCTCCGCTGAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 


456 


Qy 


241 


ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACAC^ 


300 


Db 


457 


IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIII 

ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAAAAGGAAGCTGCAAAGG 


516 


Qy 


301 


AGAAGGAAAGGAAGTAC^CTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 

III III III IIIIIIIIIMIIII IIIIMIIIIIII Mill II llllllllllllll 

AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTG CAGGAAACAAGAC CTA 


360 


Db 


517 


576 


Qy 


361 


CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 


420 


Db 


577 


II 1 1 1 1 1 II 1 II II 1 1 II 1 II 1 1 II 1 1 II 1 M 1 II II 1 II II 1 1 1 1 1 M 1 1 II 1 II 1 M 1 

CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 


636 



Qy 421 CTG CTTGAG CAACCTG CAAAACATCGGA^CACCTGCCAAATATCAATAATGAGTTCAA.TA 480 

IMIIIIIIIIIMIIIIIIIIMMIIIMMIMIMMIIMIMIIIIIIIIIIII 

Db 637 CTG CTTGAG CAACCTG CAAAAG^TCGGAACACCTGCCAAATATCAATAATGAGTTCAATA 696 

Qy 481 TCATTT(^GAGATGGGCATTTCCCTC^TGAAATAC^CAAGTAAACATTCC 531 

IIIIIIIIIIIIIMIIMIIIIMIIMIIillllllllllllllllll 

Db 697 C(^TTTCAGAGATGGG(^TTTCCCTCAATGAAATACAC^GTAAACATTCC 747 



RESULT 5 

RNIGFI1 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 

FEATURES 

source 



misc feature 



CDS 



misc feature 



BASE COUNT 



RNIGFI1 958 bp mRNA linear ROD 12-SEP-1993 

Rat mRNA (clone IGF1AB1) for insulin-like growth factor I. 

X06107 M32260 Y00429 

X06107. 1 GI :56424 

insulin-like growth factor I. 

Rattus norvegicus (Norway rat) 

Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus. 
1 

Shimatsu,A. and Rotwein,P. 

Sequence of two rat insulin-like growth factor I mRNAs differing 

within the 5 ' untranslated region 

Nucleic Acids Res. 15 (17), 7196 (1987) 

88015572 

3658684 

2 (bases 1 to 958) 
Rotwein, P . 
Direct Submission 

Submitted (21-OCT-1987) Rotwein P., Washington University, School 
of Medicine, 660 South Euclid Avenue, Box 8127, St. Louis, MO 
63110, USA 

Another IGF-I mRNA of rat liver differing in the 5* UT- region is 
described in <X06108>. . 

Locat ion/Qual i f iers 

1. .958 

/organism^" Rattus norvegicus" 
/mol_type= "mRNA" 
/ s t r a in= " S rpague - Da wl ey " 
/db_xref ="taxon: 10116" 
/tissue_type=" liver" 
1. .258 

/note="5' UT-region" 
259. .738 

/note="IGF-I (AA 1-159)" 
/codon_start=l 
/protein_id="CAA29480 .1" 
/db_xref-"GI : 56425" 
/db_xref ="SWISS-PROT: P08024" 

/ translation="MGKISSLPTQLFKI CLCDFLKIKIHIMSSSHLFYLALCLLTFTS 
SATAGPETLCGAELVDALQFVCGPRGFYFNKPTGYGSS I RRAPQTGI VDECCFRSCDL 
RRLEMYCAPLKPTKSARSIRAQRHTDMPKTQKSQPLSTHKKRKLQRRRKGSTLEEHK" 
736. .958 

/note= n 3' UT-region" 
260 a 246 c 209 g 243 t 



ORIGIN 



Query Match 95.8%; Score 516.6; DB 10; Length 958; 

Best Local Similarity 98.3%; Pred. No. 6.1e-150; 

Matches 522; Conservative 0; Mismatches 9; Indels 0; Gaps 0; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

1 1 IIIMIMI I MINIM I Illllllll I Illllllll I MINIMI I Mill MM 

Db 4 03 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 4 62. 



Qy 



Db 



61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGCACCACAG 120 

MINIMI MINIM IMIII I 1 1 1 1 1 1 i II I M I ! 1 M I Mil < 1 1 1 1 

463 AGGGGCTTTTACTTCAACAAGCCCAaVGGCTATGGCTCCAGCATTCGGAGGGC^CCAC^G 522 



Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



121 ACGGG CATTGTGGATGAGTGTTG CTTCCGGAG CTGTGAT CTGAGGAGG CTGGAGATGTAC 18 0 

IIIMINI IIIINN NIMIIII MMMMMNNINN IIIINII 

523 ACGGG CATTGTGGATGAGTGTTGCTTCCGGAG CTGTGATCTGAGGAGG CTGGAGATGTAC 582 

181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I M I I I I I ! I I I 

583 TGTGCTCCGCTGAAGCCTA(^VAAGTC^GCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 642 

241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACA 3 00 

NINININ INININ Illllllll IIIINN INIIIMI MIMIIII 

643 ATGCCCAAGACTCAGAAGTCCC^GCCCCTATCGACTVCACAAGAAAAGGAAGCTGC^ 702 

3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTG CAGGAAACAAGAC CTA 360 

NININI Illllllll IINNIN IIIINN INIIIIN INIINN 

703 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTG CAGGAAACAAGAC CTA 762 
361 OVGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 420 

I I Illllllll I Illllllll I MINI I II Illllllll Ml II III 1 1 1 1 1 III III I 

763 CAGAATGTAGGAGGAGCCTCCCGAGGAA(^GAAAATGCCACGTCACCGCAAGATCCTTTG 822 
421 CTGCTTGAGCAACCTGCAAAACATCGGAAC^ 480 

IIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIII 

823 CTG CTTGAG CAACCTGCAAAACAT CGGAACACCTG CCAAATAT CAATAATGAGTTCAATA 882 

4 81 Tt^TTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCC 531 

1 1 1 1 1 1 1 1 1 M I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I II 

883 CCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCC 933 



RESULT 6 
BC012409 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



linear ROD 16-APR-2003 
mRNA (cDNA clone 



BC012409 1536 bp mRNA 

Mus musculus insulin-like growth factor 
MGC : 18617 IMAGE : 4194295 ) ., complete cds . 
BC012409 

BC012409.1 GI:15214568 
MGC . 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 1536) 

Strausberg,R.L. , Feingold, E.A. , Grouse, L.H., Derge, J.G. , 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



Klausner , R . D. 
Altschul,S.F. 
Hopkins, R.F. , 
Diatchenko, L. 
Stapleton,M. , 
Scheetz,T.E. , 
Carninci , P . , 



Collins, F.S. , Wagner, L. , Shenmen , C . M . , Schuler,G.D. , 
, Zeeberg,B., Buetow,K.H., Schaef er , C . F . , Bhat,N.K., 
Jordan, H., Moore ,T., Max,S.I., Wang, J., Hsieh,F., 
, Marusina, K. , Farmer, A. A. , Rubin , G.M., Hong,L., 
Soares,M.B. , Bonaldo,M.F. , Casavant ,T.L. , 
Brownstein, M . J . , Usdin,T.B., Toshiyuki , S . , 
Prange,C. , Raha,S.S. , Loquellano, N.A. , Peters, G.J. , 
Abramson,R.D. , Mullahy, S.J. , Bosak,S.A. , McEwan, P.J. , 
McKernan,K. J. , Malek,J.A., Gunaratne, P y . H . , Richards, S., 
Worley,K.C, Hale,S., Garcia,A.M. , Gay,L.J. # Hulyk,S.W., 
Villalon,D.K. , Muzny, D . M . , Sodergren, E . J . , Lu,X., Gibbs,R.A., 
Fahey,J., Helton, E . , Ketteman/M. , Madan,A. , Rodrigues , S . , 
Sanchez , A. , Whiting, M. , Madan,A. , Young, A. C. , Shevchenko , Y . , 
Bouf fard,G.G. , Blakesley , R . W . , Touchman , J . W . , Green, E.D. , 
Dickson, M. C. , Rodriguez , A . C . , Grimwood, J . , Schmutz # J., Myers ,R.M., 
Butterf ield, Y.S. , Krzywinski , M . I . , Skalska,U., Smailus , D . E . , 
Schnerch,A. , Schein,J.E., Jones, S.J. and Marra,M.A. 
Generation and initial analysis of more than 15,000 full-length 
human and mouse cDNA sequences 

Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) 

22388257 

12477932 

2 (bases 1 to 1536) ' 
Strausberg, R. 
Direct Submission 

Submitted (15-AUG-2001) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL : http://mgc.nci.nih.gov 

Contact : MGC help desk 

Ema i 1 : cgapbs - r@ma i 1 . nih . gov 

Tissue Procurement: Jeffrey E. Green, M.D. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Baylor College of Medicine Human Genome 

Sequencing Center 

Center code: BCM-HGSC 

Web site: http://www.hgsc.bcm.tmc.edu/cdna/ 
Contact : amg@bcm . tmc . edu 

Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H. , 
Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, 
A.N. , Gibbs, R.A. 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/ LLNL at: http://image.llnl.gov 
Series: IRAK Plate: 24 Row: k Column: 11 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi : 6754307. 
FEATURES Location/Qualifiers 
source 1. .1536 

/organism="Mus musculus" 
/mol_type="mRNA" 
/strain="FVB/N" 
/db_xref ="taxon: 10090" 
/clone="MGC: 18617 IMAGE : 4 194295 11 

/tissue_type= "Liver, normal. 5 month old male mouse." 



/clone_lib="NCI_CGAP_Li9" 
/lab_host= fI DH10B" 
/note="Vector:- pCMV-SP0RT6" 
gene 1 . . 1536 

/gene= l, lgfl" 

/db_xref="LocusID: 16000" 
/db_xref="MGI : 96432" 
CDS 28. .429 

/ codon_start=l 

/product =" insulin- like growth factor 1" 
/protein_id="AAH12409. 1" 
/db_xref="GI : 15214569" 
/db_xref ="LocusID: 16000" 

/ trans la tion= "MSSSHLFYLALCLLTFTSSTTAGPETLCGAELVDALQFVCGPRG 
FYFNKPTGYGSS I RRAPQTGI VDECCFRSCDLRRLEMYCAPLKPTKAARS I RAQRHTD 
MPKTQKSPSLSTNKKTKLQRRRKGSTFEEHK" 

BASE COUNT 485 a 324 c 303 g 424 t 

ORIGIN 

Query Match 87.2%; Score 470.2; DB 10; Length 1536; 

Best Local Similarity 92.8%; Pred. No. 2.1e-135; 

Matches 493; Conservative 0; Mismatches 38; Indels 0; Gaps 0; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

II Ml 1 1 1 1 1 1 1 1 1 Ml M II I MMI I Ml 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 

Db 94 GGACCAGAGAC C CTTTGCGGGG CTGAG CTGGTGGATG CTCTT CAGTT CGTGTGTGGA C CG 153 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTC 12 0 

1 1 M 1 1 1 1 1 1 1 1 1 1 mi i : ii 1 1 1 m 1 1 M 1 1 mi : 1 1 ill m i ii 1 1 mi mi 

Db 154 AGGGGCTTTTACTTCAA(^GCC(^C^GGCTATGGCTC(^GC^TTCGGAGGGCACCTCAG 213 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

II 1 1 1 1 1 M I 1 1 1 1 1 1 1 1 II I Ml II ! II 1 1 1 1 II Ml 1 1 1 II I Mllllllll 

Db 214 ACAGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGACTGGAGATGTAC 273 

Qy 181 TGTGTCCGCTGC^GCCTACAAAGTC^GCTC^^ 24 0 

MM II II III I Mill MM M II I Ml I 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

Db 274 TGTGCCCCACTGAAGCCTACTW^GCAGCCCGCTCTATCCGTGCCC^GCGCC^CACTGAC 333 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGAC^ 300 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 I I II I II I II 1 1 : I I I I II 1 1 III II I II 

Db 334 ATG C C CAAGACTCAGAAGTCCCCGTCCCTATCGACAAACAAGAAAACGAAGCTG CAAAGG 393 

Qy 301 AGAAGGAAAGGAAGTACACTTGAAGAACACAA 360 

1 1 M 1 1 M 1 1 1 M M I M M 1 1 1 1 It 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 

Db 394 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 4 53 

Qy 361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATG CCACGT CACCG CAAGAT C CTTTG 42 0 

II II II M I I M II I M I II I I III MMIMIIMM I I I I I I I I I I [ I I I I I I 

Db 454 C7VGAATGTAGGAGGAGCCTCCC^CGGAGCAGAAAATGCCAC^TCACCGCAGGATCCTTTG 513 

Qy 421 CTGCTTGAGCAACCTGCAAAACATCGGAAC^ 48 0 

IIMIMMMIIMMMMMMI II Mill I I'M Mlllll III Mill 

Db 514 CTGCTTGAGCAACCTGCAAAACATCGAAACACCTACCAAATAA 573 

Qy 481 TCATTTCAGAGATGGGCATTTCCCT^ 531 

MM II 1 1 1 1 1 J i I i 1 1 1 i 1 1 Mllllllll IMMMMIIMM 



Db 



574 ACATTACAAAGATGGGCATTTCCCCCAATGAAATATACAAGTAAACATTCC 624 



RESULT 7 
MMIGFIBR 
LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



FEATURES 

source 



misc feature 



CDS 



sig_peptide 
mat_peptide 
misc_feature 
polyA_site 



MMIGFIBR 651 bp mRNA linear ROD 21-MAR-1995 

Mouse mRNA for preproinsul in-like growth factor IB. 

X04482 

X04482.1 GI:51806 

growth factor; insulin-like growth factor IB; preproinsulin-like 
growth factor IB; signal peptide. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 651) 

Bell, G. I., Stempien,M.M. , Fong , N . M . andRall,L.B. 

Sequences of liver cDNAs encoding two different mouse insulin-like 
growth factor I precursors 

Nucleic Acids Res. 14 (20), 7873-7882 (1986) 

87040760 

3774549 

The sequence is identical to the preproIGF-IA sequence (X04480) 
except for the presence of a 52 bp insertion following codon 86 
(position 397 to 448), caused by alternative RNA splicing. The B 
domain of IGF comprises residues 1-29 (position 139-225) , the C 
domain residues 30-41 (position 226-261) , the A domain residues 
42-62 (position 262-324) and the D domain residues 63-70 (position 
325-348) . 

Location/Qualifiers 
1. .651 

/organism="Mus musculus" 
/ mol_type= " mRNA" 
/strain="Balb/c" 
/db_xref ="taxon: 10090" 
/clone="migf 1 -4 " 
/ 1 i s sue_type= " 1 iver " 
43. .45 

/note="pot. translation start" 
73. .474 

/note="prepro IGF-IB (aa -22 to 111)" 
/codon_start=l 
/protein__id="CAA28170. 1" 
/db_xref="GI: 51807" 
/db_xref ="MGI : 96432" 
/db_xref="SWISS-PROT: P05018" 

/translation^" MSSSHLFYLALCLLTFTSSTTAGPETLCGAELVDALQFVCGPRG 
FYFNKPTGYGSSIRRAPQTGIVDECCFRSCDLRRLEMYCAPLKPTKAARSIRAQRHTD 
MPKTQKSPSLSTNKKTKLQRRRKGSTFEEHK" 
73. .138 

/note=" signal peptide (aa -22 to -1) " 
139. .348 

/product =" mature IGF-IB (aa 1-70) " 
349. .471 

/note="COOH-terminal peptide (E domain) (aa 71 to 111) " 
651 

/note="polyA site" 



BASE COUNT 
ORIGIN 



193 a 



185 c 



149 g 



124 t 



Query Match 84 . 2\ 

Best Local Similarity 92.83 
Matches 4 76; Conservative 



Score 453.8; DB 10; Length 651; 
Pred. No. 2.4e-130; 
0; Mismatches 37; Indels 0; Gaps 



0; 



Qy 



Db 



1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

IIIIINIMIMIIIIIIIIIIIIIMIIIIMI I II II I II I II I II III II II 1 1 

13 9 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 198 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



61 AGGGGCTTTTACTTC^CAAGCCCACAGTCTATGGCTCC^GCATTCGGAGGGCACCACAG 12 0 

IIIIIIIIIMIIIMMIIIIMIIII MIMMIMMIMMMMIIMM III 

199 AGGGGCTTTTACTTCAACAAGCCCA.CAGGCTATGGCTCCAGCATTCGGAGGGCACCTCAG 258 
121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II 1 1 1 1 1 M 1 1 1 1 1 : III ll 1 1 M 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 llllllllllll 

259 ACAGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGACTGGAGATGTAC 318 
181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

Mill! Ill II I III II MM II II lllll MINI llllllllllll 

319 TGTGCCCC^CTGAAGCCTACAAAAGC^^ 378 
241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACA 3 00 

IMIIIIIIIIMIIIIIIIII I MINI III MINIM! MIMMMMM 

379 ATGCC(^GACTC^GAAGTCCCCGTCCCTATCGACAAACAAGAAAACGAAGCTGCAAAGG 43 8 
3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 36 0 

MMMMMMMIMI M I Ml M I M I M 1 1 1 II M I 1 1 1 M I M 1 1 1 1 1 1 1 1 I 

43 9 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGAAGTGCAGGA^ 498 

3 61 (^GAATGTAGGAGGAGCCTCCCGAGGAA(^GAAAATGC(^CGTC^CCGauVGATCCTTTG 42 0 

MMIIIMI MMIIMI III III I III Mill MM Mill i II III MM 

4 99 CAGAATGTAGGAGGAGCCTCCCACGGAGCAGAAAATGCCA 558 
421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATC^^ 48 0 

1 1 1 1 1 1 : 1 1 1 1 M 1 1 1 ! ! 1 1 : II III II II Mill lllll II III lllll 

559 CTGCTTGAGCAACCTGCAAAACATCGAAACACCT^ 618 
481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAA 513 

MM II IMMMMMIMI llllllll 

619 ACATTACAAAGATGGGCATTTCCCCCAATGAAA 651 



RESULT 8 
AX147752 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 



linear PAT 31-AUG-2001 



AX147752 487 bp DNA 

Sequence 11 from Patent WO0136483. 
AX147752 

AX147752 .1 GI : 14346797 



Rattus norvegicus (Norway rat) 
Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; 
Rattus. 
1 



AUTHORS Goldspink,G.R. and Johnson, I. R. 

TITLE Use of the insulin-like-growth factor i isoform mgf for the 

treatment of neurological disorders 
JOURNAL Patent: WO 0136483-A 11 25-MAY-2001; 
University College London (GB) 
FEATURES Location/Qualifiers 
source 1. .487 

/organism="Rattus norvegicus" 
/mol_type=" genomic DNA" 
/db_xref = 11 1 axon: 10116" 
CDS . <1. .318 

/not e= "unnamed protein product" 

/codon_start=l 

/prot ein__id= " CAC4 1179.1" 

/db_xref="GI : 14346798" 

/ db_xref - "REMTREMBL: CAC4 1179 " 

/trans 1 a t ion= " GPETLCGAELVDALQFVCGPRGFYFNKPTVYGSS I RRAPQTGI V 
DECCFRSCDLRRLEMYCVRCKPTKSARS I RAQRHTDMPKTQKEVHLKNTSRGSAGNKT 
YRM" 

BASE COUNT 13 9 a 123 c 126 g 99 t 

ORIGIN 

Query Match 78.8%; Score 425; DB 6; Length 487; 

Best Local Similarity 90.4%; Pred. No. 2.3e-121; 

Matches 487; Conservative 0; Mismatches 0; Indels 52; Gaps 1; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

IMMIMI IMIII MM MIMIIIII Mill Mill II MIMMIMI Ml 

Db 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCT 12 0 

MIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIMIIII 

Db 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGCACCACAG 120 

Qy 121 ACGGG CATTGTGGATGAGTGTTG CTTCCGGAG CTGTGATCTGAGGAGG CTGGAGATGTAC 180 

IIIIIIIIIMIIIMIIMIIIIIIIIIIIIIIIMIIIIIIIIIMIIMIIIIMM 

Db 121 ACGGG CATTGTGGATGAGTGTTG CTTCCGGAG CTGTGATCTGAGGAGG CTGGAGATGTAC 180 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTT 24 0 

MMMMMMMMMMMMMMMMMMMMMMMMMMIMIMM 

Db 181 TGTGTCCGCTGCAAGCCTACAAAGTC^GCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAAAAGGAA 300 

IIIIMIIIIIIIII 

Db 241 ATGCCCAAGACTCAG 255 

Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAAO^CAAGTAGAGGAAGTGCAGGAAACAAGACCTA 360 

I I MM I I I I I I I I Ml I I I i M I I II I I I II I I Ml I li I I I II I M I 

Db 256 AAGGAAGTACACTTGAAGAA(^(^GTAGAGGAAGTGCAGGAAAC^AGACCTA 308 

Qy 361 CAGAATGTAGGAGGAGCCTCCCGAGGAACA.GAAAATGCCACGTCACCGCAAGATCCTTTG 42 0 

llllll MIIIIIIIMI IMIIMMI MIMMIMI MIMMIMI III 

Db 3 09 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 368 

Qy 421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATC^ 480 

II II Ml I II III Ml 1 1 I'll I III II Ml 1 1 Mill II I II I II III 1 1 ill II III 1 1 

Db 369 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAA 428 



Qy 481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAA.TACACAAGTAAACATTCCCGGAATTC 539 

I l;hl I I I: I I I I M I Ml I I I hi I I I ! 

Db 42 9 TCATTTCAGAGATGGGCATTTCCCT 4 87 



RESULT 9 
AX300789 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



AX300789 487 bp DNA 

Sequence 11 from Patent WO0185781. 
AX300789 

AX30078 9 . 1 GI : 173 82 070 

Rattus sp. 
Rattus sp. 



CDS 



BASE COUNT 
ORIGIN 



linear PAT 3 0 -NOV- 2 001 



Eukaryota; Metazoa; Chordata; Crariiata; Vertebrata; Euteleostomi ; 

ft/T-^ mm "I -i -i « T?-> -i I- Vi d >~ -i -3 • DnH Qn hi 3 • Cpi n rnrrn 3 +* Vi i - Mi i v t H a o - lVh i v i nao • 



JUl; 

East Grinstead Medical Research 



Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; 
Rattus . 
1 

Goldspink,G.D. and Terenghi , G . B . 
Repair of nerve damage 
Patent: WO 0185781-A 11 15-NOV-2001; 
University College London (GB V 
Trust (GB) 

Locat ion/Qua 1 if iers 

1. .487 

/organism= n Rattus sp. " 
/mol__type=" genomic DNA" 
/db_xref ="taxon: 10118" 
<1. .318 

/not e= "unnamed protein product" 
/codon_start=l 
/protein_id= M CAD13 044 . 1" 
/db_xref-"GI : 17382071" 

/ trans la t ion= " GPETLCGAELVDALQFVCGPRGFYFNKPTVYGSS I RRAPQTGI V 
DECCFRSCDLRRLEMYCVRCKPTKSARS I RAQRHTDMPKTQKEVHLKNTSRGSAGNKT 

YRM" 



YRM 

139 a 123 c 126 g 



99 t 



Query Match 78,8%; Score 425; DB 6; Length 487; 

Best Local Similarity 90.4%; Pred. No. 2.3e-121; 

Matches 487; Conservative 0; Mismatches 0; Indels 52; 



Gaps 



1; 



Qy 
Db 



1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

1 1 1 1 MMMi M I 1 1 1 1 1 1 1 1 1 II 1 1 1 Ml M M 1 1 1 1 1 1 1 1 1 1 1 1 II II M 

1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 



Qy 

Db 

Qy 

Db 



61 AGGGGCTTTTACTTCAA(^GCCCACAGTCTATGGCTC(^GCATTCGGAGGGCAC(^CAG 120 

I I II I ll II I I : M I I I I I I II III I I I III I M I Mil II I 1 1 II I II ■ I I I I 

61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGG 120 
121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

IIIIIIIMIIIIMIIIIIIIIMIIIIMIIIIIIIIIilllMIIIIIIIMIIIII 

121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 



Qy 



181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 240 



Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 



. 1 1 II I II 1 1! 1 1 1: 1 1 1 1 1 1 1 1 1 1 M I M ! 1 1 II I M III I i 1 1 1 M I II I 

181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCC^GCGCCAC7VCTGAC 24 0 

241 ATG CCCAAGACT CAGAAGTC C CAG C C CCTATCGACACACAAGAAAAGGAAGCTG CAAAGG 300 

II I II I II Ml II II 

241 ATGCCCAAGACTCAG 255 

3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 360 

! 1 1 1 1 1 II I III I II 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 : 1 M 1 1 II III 1 1 1 

256 AAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 308 

3 61 CAGAATGTAGGAGGAGCCTC C CGAGGAACAGAAAATG C CACGTCAC CG CAAGATCCTTTG 42 0 

IIIIIIMIIMIIIIIIIMIII.IIMMM llllllll llllllll IIIIMIMIII 

3 09 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCG CAAGATCCTTTG 368 
421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCAATAATC 480 

IIIIIIIMM il MINN MINIMI III llllllll IIMIIII MINIMUM 

369 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCAATAATGAGTTCAATA 428 
481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCCCGGAATTC 539 

III llllllll MIIMIIMIIII IIIIIIIIMIMIIIII I'lllllllilllll 

429 TCATTTCAGAGATGGGC^TTTCCCTCAATGAAATACAC^GTAAACATTCCCGGAATTC 4 87 



RESULT 10 
RATI GF IB 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



FEATURES 

source 



CDS 



RATIGFIB 1346 bp mRNA linear ROD 27-APR-1993 

Rat insulin-like growth factor I (IGF-I) mRNA, complete cds . 
M15481 

M15481. 1 GI :204753 

growth factor; insulin-like growth factor. 
Rattus norvegicus (Norway rat) 
Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus . 

1 {bases 1 to 1346) 

Roberts, C. T. Jr., Lasky,S.R., Lowe,W.L. 'Jr., Seaman, W.T. and 
LeRoith,D. 

Molecular cloning of rat insulin-like growth factor I complementary 

deoxyribonucleic acids: differential messenger ribonucleic acid 

processing and regulation by growth hormone in extrahepatic tissues 

Mol . .Endocrinol . 1 (3), 243-248 (1987) 

88288198 

3453891 

Original source text: Rat (Sprague-Dawley) adult liver, cDNA to 
mRNA, clone pRIGF-1-25. 

Draft entry and computer-readable copy of sequence in [Mol . 
Endocrinol. (1987) In press] kindly 
provided by S.R.Lasky, 16-MAR-1987. 

Location/Qualifiers 

1. .1346 

/organism^" Rattus norvegicus" 
/mol_type= " mRNA" 
/db_xref="taxon: 10116" 
794. .1177 

/note-' 1 insulin-like growth factor precursor" 



/codon_start=l 
/protein_id="AAA41387. 1" 
/db_xref="GI: 204754" 

/translation= n MSSSHLFYLALCLLTFTSSATAGPETLCGAELVDALQFVCGPRG 

FYFNKPTGYGSS I RRAPQTGI VDECCFRSCDLRRLEMYCVRCKPTKSARS I RAQRHTD 

MPKTQKEVHLKNTSRGSAGNKTYRM " 
sig_peptide 794. .859 

/note="insulin-like growth factor signal peptide" 
mat_peptide 860. .946 

/product =" insulin-like growth factor B peptide" 
mat_peptide 947. .982 

/product=" insulin-like growth factor C peptide" 
mat_peptide 983. .1045 

/product =" insulin- like growth factor A peptide" 
mat_peptide 1046. .1069 

/product="insulin-like growth factor D peptide" 
mat_peptide 1070. .1174 

/product-" insulin-like growth factor E peptide" 
BASE COUNT 348 a 344 c 344 g 310 t 

ORIGIN 1 bp upstream of EcoRI site. 

Query Match 78.3%; Score 421.8; DB 10; Length 1346; 

Best Local Similarity 90.0%; Pred. No. 2.8e-120; 

Matches 485; Conservative 0; Mismatches 2; Indels 52; Gaps 1; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

II 1 1 1 MMMM I 1 1 II 1 1 1 1 II II MIM I I II I II I II M I MUM IM 

Db 86 0 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 919 

Qy 61 AGGGGCTTTTACTTCAACAAGCCC^CAGTCTATC 12 0 

llllllllllllllllllllllllllll MMMMMMMMMMMMMIMM 

Db 92 0 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCACAG 979 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

III MM 1 1 1 1 1 1 ■ Ml I II I IMM 1 1 1 Ml I M II IM 1 1 Ml I M 1 1 1 1 II II 

Db 98 0 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 1039 

Qy 181 TGTGTCCGCTGCAAGCCTAOVAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

II 1 1 1 1 1 1 Ml I II I II I 1 1 1 II M MIM I II 1 1 1 II I II 1 1 1 M M 1 1 II 

Db 104 0 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 1099 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAAAAGGAAGCTG 300 

MMIMIMI I 

Db 1100 ATGCCCAAGACTCAG 1114 

Qy 301 AGAAGGAAAGGAAGTA(^CTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I 
Db 1115 AAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 1167 

Qy 3 61 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 42 0 

1 1 M 1 1 1 M I ! 1 . 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 M I ! II ! ! 1 1 1 1 1 

Db 1168 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 1227 

Qy 421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTG 480 

MINIMUM IIIMIIIIIIIIII MINI llllllllllllllll Ml II MINI 

Db 122 8 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTC 1287 



Qy 481 TCATTTCAGAGATGGGaVTTTCCCTt^TGAAATACAO^GTAAACATTCCCGGAATT^ 539 

II 1 1 1 1 1 1 1 M I I li ,11 1 1 1 II I M I M 1 1 1 Mill ! 1 1 . II I II II I M 

Db 1288 CQ^TTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCCCGGAATT^ 1346 



RESULT 11 
RATIGF1A 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



Craniata ; Vertebrata ; Euteleostomi ; 
Sciurognathi ; Muridae; Murinae; 



RATIGF1A 1052 bp mRNA linear ROD 2 9 -MAY- 2 002 

Rattus sp. mRNA for insulin- like growth factor I precursor, 
complete cds . 
D00698 

D00698.1 GI:220780 
insulin- like growth factor I. 
Rattus sp . 
Rattus sp. 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodent ia; 
Rattus. 

1 (bases 28 to 1052) 
Shimatsu,A. and Rotwein,P. 

Mosaic evolution of the insulin-like growth factors. Organization, 
sequence, and expression of the rat insulin-like growth factor I 
gene 

J. Biol. Chem. 262 (16), 7894-7900 (1987) 
87222423 
3034909 

2 (bases 1 to 1052) 

Kato,H., Okoshi,A., Miura,Y. and Noguchi,T. 

A new cDNA clone relating to larger molecular species of rat 
insulin-like growth factor-I mRNA 
Agric. Biol. Chem. 54 (6), 1599-1601 (1990) 
91103966 
1368571 

The difference in the size of IGF- I mRNA has been suggested to be 
primarily due to differences in the 3 ' -untranslated region in [1]. 
The cDNA clone to IGF-I mRNA reported in [2] gives direct evidence 
for this hypothesis. 

Location/Qualif iers 
1. .1052 

/organism^" Rattus sp." 
/mol_type="mRNA" 
/db_xref ="taxon: 10118" 
/ 1 i s sue_type= " 1 i ver " 

/note="788 bp upstream of Avail I site." 
43. .426 
/codon_start=l 

/product = 11 insulin-like growth factor I precursor" 
/protein__id="BAA00604 . 1" 
/db_xref="GI : 220781" 

/ 1 rans la t ion= " MSSSHLFYLALCLLTFTSSATAGPETLCGAELVDALQFVCGPRG 
FYFNKPTGYGSSIRRAPQTGIVDECCFRSCDLRRLEMYCAPLKPTKSARSIRAQRHTD 
MPKTQKEVHLKNTSRGSAGNKTYRM " 
109. .318 

/product =" insulin-like growth factor mature peptide" 
606 

/note="g in [2]; deletion in [1]" 
variation 625 



REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



FEATURES 

source 



CDS 



mat_peptide 



variation 



/note="c in [2]; a in [1]" 
variation 645 

/note="a in [2] ; deletion in [1] " 
polyA_signal 710. .715 
polyA_site 731 

/note="polyadenylation site (putative) " 
polyA_site 1023 

/note="polyadenylation site (putative) " 
BASE COUNT 301 a 256 c 225 g 270 t 

ORIGIN 

Query Match 74.7%; Score 402.6; DB 10; Length 1052; 

Best Local Similarity 88.5%; Pred. No. 2.7e-114; 

Matches 470; Conservative 0; Mismatches 9; Indels 52; Gaps 1; 



Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

1 1 M M i 1 1 1 1 1 M 1 1! 1 1 : II i II I II I MM 1 1 I 1 1 1 1 1 1 1 1 1 1! Ill 1 1 III II M 

Db 109 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 168 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATC 12 0 

M 1 1 1 1 1 1 1 1 III 1 1 1 1 1 Ml II I MM 1 1 1 1 1 1 1 1 1 MM Ml 1 1 II 1 1 II 1 1 II 1 1 

Db 169 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGG^ 228 

Qy 121 ACGGG CATTGTGGATGAGTGTTG CTT C CGGAG CTGTGATCTGAGGAGG CTGGAGATGTA C 18 0 

IMMM 1 1 M Ml I Ml I Ml I II I II Mill I II 1 1 1 1 1 1 1 1 1 1 1 MM 1 1 MM i 

Db 22 9 ACGGG CATTGTGGATGAGTGTTGCTTCCGGAG CTGTGATCTGAGGAGG CTGGAGATGTA C 288 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATC 24 0 

I I I I I I I I I I I II I I I II M I I I I I I I I I I I I MMM I I II I I I I I I II I I 

Db 28 9 TGTGCTCCGCTGAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 34 8 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAAAAGGAAGCT 3 00 

MM Mill MM 

Db 34 9 ATGCCCAAGACTCAG 3 63 

Qy 301 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTG CAGGAAACAAGAC CTA 360 

M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 Ml 1 1 1 1 II M M II MMM 1 1 II I M I II II 1 1 

Db 364 AAGGAAGTAC^CTTGAAGAACACAAGTAGAGGAAGTGC^GGAAAa^GACCTA 416 

Qy 361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTC 420 

Db 417 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 4 76 

Qy 421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCT 480 

I II 1 1 1 1 1 M 1 1 II Ml I II II I M III II I M II 1 1 Ml II II I M III II I II M I 

Db 477 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCC^^TATC^TAATGA 536 

Qy 481 TCATTTCAGAGATGGGCATTTCCCTC^TGAAATACACAAGTAAACATTCC 531 

MM II 1 1 1 1 1 1 II I Ml M Ml II I II I II II I M 1 1 1 1 1 1 1 1 II MM 

Db 537 CCATTTCAGAGATGGGCATTTCCCTCAATGAAATACA.CAAGTAAACATTCC 587 



RESULT 12 
RNMGJIL 

ID RNMGJIL standard; RNA; ROD; 513 BP. 
XX 

AC M17714; 



XX 

SV M17714.1 
XX 

DT 01-OCT-1996 (Rel. 49, Created) 

DT 04-MAR-2000 (Rel. 63, Last updated, Version 2) 
XX 

DE Rat insulin-like growth factor-I mRNA, 3' end. 
XX 

KW . insulin-like growth factor. 
XX 

OS Rattus norvegicus (Norway rat) 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; Mammalia; 

OC Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Rattus. 

XX 

RN [1] 

RP 1-513 

RX MEDLINE; 87246437. 

RX PUBMED; 3595538. 

RA Murphy L.J., Bell G.I., Duckworth M.L., Friesen H.G.; 

RT ' "Identification, characterization, and regulation of a rat complementary 

RT deoxyribonucleic acid which encodes insulin- like growth factor-I"; 

RL Endocrinology 121 (2) : 684-691 (1987) . 
XX 

DR GOA; P08025 . 

DR SWISS-PROT; P08025; IGFA_RAT. 
XX 

FH Key Location/Qualifiers 
FH 

FT source 1. .513 

FT /db_xref= n taxon: 10116" 

FT /organism^" Rattus norvegicus" 

FT mRNA <1. .513 

FT /note="ILGF-I mRNA" 

FT sig_peptide <1. .9 

FT /note="insulin-like growth factor-I signal peptide" 

FT CDS <1. .327 

FT /codon_start=l 

FT /db_xref-"GOA: P08025" 

FT /db_xref= n SWTSS-PROT:P08025 M 

FT /note=" insulin-like growth factor-I precursor" 

FT /protein_id="AAA41227. 1" 

FT /translat ion="ATAGPETLCGAELVDALQFVCGPRGFYFNKPTGYGSSIRRAPQTG 

FT IVDECCFRSCDLRRLEMYCAPLKPTKSARSIRAQRHTDMPKTQKEVHLKNTSRGSAGNK 
FT TYRM " 

FT mat_peptide 10. .324 

FT /note=" insulin-like growth factor-I" 

XX 

SQ Sequence 513 BP; 143 A; 128 C; 135 G; 107 T; 0 other; 

Query Match 74.4%; Score 401; DB 26; Length 513; 

Best Local Similarity 88.3%; Pred . No. 7.5e-114; 

Matches 4 69; Conservative 0; Mismatches 10; Indels 52; Gaps 1; 

Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

I I I I I I I I II I I I I I I IM I ll I ill II ill I I I I I I I I I II I I I INN I I I II I I 

Db 10 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 69 



Qy 61 AGGGGCTTTTACTTCAACAAGCC 12 0 

MM 1 1 1 II II 1 1 1 1 1 1 M I MM I MMMIMIMM MIMIMMIMM 

Db 70 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCC^^ 12 9 

Qy 121 ACGGG CATTGTGGATGAGTGTTG CTTCCGGAG CTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

lllllll MMMIMM MMMMMMMMMIMM IIMIIIIIIMMI 

Db 13 0 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 189 



Qy 181 TGTGTCCGCTGC^GCCTACTWIGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

1 1 1 1 I 1 1 ! 1 1 1 ' I 1 1 II 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I II M I 1 1 

Db 190 TGTGCTCCGCTGAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 9 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACA 300 

I I Ml II I MM III 

Db 25 0 ATGCCCAAGACTCAG 264 

Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAAC^CAAGTAGAGGAA 360 

II MIMIIIMI IIMMIMI MIMIIMM MMMIMM I 

Db 265 AAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 317 

Qy 361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 42 0 

1 1 II I II 1 1 1 1 M I II 1 1 1 1 1 M II I II I II I M 1 1 1 1 1 II II M 1 1 1 1 1 II II 1 1 M 1 1 

Db 318 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 377 

Qy 421 CTGCTTGAGCAACCTGCAAAAC^TCGGAAC^ 480 

IIIIIMM I MMM II 1 1 ! I M Ml I II I M M II II 1 1 1! M M I II I M II 

Db 378 CTGCTTGAGGAACCTGCAAAACATCGGAACACCTC 437 

Qy 481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCC 531 

IMIMI MMMIMM MMMMMMMMMIMM MM 

Db 438 CCATTTCAGAGATGGGCATTTCCCTCAATGAAATA(^ 488 

RESULT 13 
RNIGFIR 

LOCUS RNIGFIR 521 bp mRNA linear ROD 3 1 -MAR-2003 

DEFINITION Rattus norvegicus mRNA for insulin-like, growth factorl (IGF-1) . 
ACCESSION X06043 M17714 
VERSION X06043.1 GI : 56434 

KEYWORDS IGF-1; insulin-like growth factor I. 
SOURCE Rattus norvegicus (Norway rat) 

ORGANISM Rattus norvegicus 

Eukaryota; Metazoa; Chorda ta; Crania ta; Vert eb rata; Euteleostomi ; 

Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 

Rattus. 

REFERENCE 1 (bases 1 to 521) 

AUTHORS Murphy, L. J., Bell, G.I.', Duckworth, M . L . and Friesen,H.G. 
TITLE Identification, characterization, and regulation of a rat 

complementary deoxyribonucleic acid which encodes insulin- like 
growth factor- I 
JOURNAL Endocrinology 121 (2), 684-691 (1987) 
MEDLINE 87246437 
PUBMED . 3595538 

COMMENT On Apr 1, 2003 this sequence version replaced gi: 204324. 

Data kindly reviewed (ll-FEB-88) by Murphy L . J . 
FEATURES Location/Qualifiers 



source 1. .521 



/organism^" Rat tus norvegicus" 
/mol_type="mRNA" 
/db_xref="taxon: 10116" 
CDS <1. .327 

/note=" (AA -3 to 105) " 

/codon_start=l 

/product^" IGF-IA precursor" 

/prot ein__id= " CAA2 94 3 6 . 1 " 

/db_xref="GI : 56435" 

/db_xref="GOA:P08025" 

/db_xref="SWISS-PROT: P08025" 

/translation^ "ATAGPETLCGAELVDALQFVCGPRGFYFNKPTGYGSSIRRAPQT 

GIVDECCFRSCDLRRLEMYCAPLKPTKSARSIRAQRHTDMPKTQKEVHLKNTSRGSAG 

NKTYRM" 
sig_peptide <1. .9 

/note=" insulin-like growth factor-I signal peptide" 
matjpeptide 10. .324 

/product^" insulin-like growth. factor-I " 
misc_feature 10. .96 ' . 

/note="B domain (AA 1-29)" 
misc_feature 97. .132 

/note="C domain (AA 30-41)" 
misc_feature 133. .195 

/note="A domain (AA 42-62)" 
misc_f eature 196. .219 

/note="D domain (AA 63-70)" 
BASE COUNT 151 a 128 c 135 g 107 t 

ORIGIN 

Query Match 74.4%; Score 401; DB 10; Length 521; 

Best Local Similarity 88.3%; Pred. No. 7.5e-114; 

Matches 469; Conservative 0; Mismatches 10; Indels 52; Gaps 1; 

Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

Db 10 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 69 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTC 12 0 

I I Mill I Mill Mill I Mill MM I Mill I Mill Mill I II III III II I II 

Db 70 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTC 12 9 

Qy 121 ACGGG CATTGTGGATGAGTGTTG CTT CCGGAGCTGTGAT CTGAGGAGG CTGGAGATGTAC 180 

1 1 1! I M iM 1 1 1 II M 1 1 ! 1 1 II 1 1 I Ml 1 1 1 1 1 II 1 1 1 M II 1 1 M 1 1 1 1 1 II 

Db 13 0 ACGGG CATTGTGGATGAGTGTTG CTT CCGGAG CTGTGAT CTGAGGAGG CTGGAGATGTAC 18 9 

Qy 181 TGTGTCCGCTG(^GCCTAOWlGT(^G 24 0 

MM I I Mill I Mill Mill I Mill I III II Mill I Mill III II I M 

Db 190 TGTGCTCCGCTGAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 9 

Qy .241 ATGCCCAAGACTCAGAAGTCCC^GCCCCTATCGACAC^ 300 

lllllllllllllll 

Db 250 ATGCCCAAGACTCAG 264 

Qy 301 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAG 360 

Mill IMIIMIIIIIIIINIMIMIIIIIIMII Mill llllll 

Db 265 AAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 317 



Qy 
Db 

Qy 
Db 

Qy 
Db 



361 CAGAATGTAGGAGGAGC CTC CCGAGGAACAGAAAATG CCACGTCAC CGGAAGATC CTTTG 42 0 

Mi MINI MINIMI I Mlllilll MM MINIMUM III lllllllll 

318 CAGAATGTAGGAGGAGC CT CCCGAGGAACAGAAAATG CCACGTCACCGCAAGATCCTTTG 377 
421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCAATAATGAGTTCAATA 4 8 0 

lllllllll lllllllll I Ml lllllllll 1 1 IIIMI Mill Ml II lllllllll 

378 CTGCTTGAGGAACCTGCAAAACATCGGAACACCTGCCAAATATCAATAATGAGTTCAATA 437 
481 TCATTTCAGAGATGGGC^TTTCCCTC^TGAAATAC^(^AGTAAACATTCC 531 

, 1 1 1 II I Ml 1 1 1 1 1 : 1 1 1 II 1 1 1 1 Im 1 , 1 1 1 1 1 1 II 1 1 1 1 1 1 

43 8 CCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCC 488 



RESULT 14 
RATIGFIAA 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



RATIGFIAA 826 bp mRNA linear ROD 27-APR-1993 

Rat insulin-like growth factor I (IGF-I) mRNA, complete cds . 
M17335 

M17335.1 GI:204751 

insulin- like growth factor I. - 
Rattus norvegicus (Norway rat) 
Rattus norvegicus 

Chordata; Craniata; Vertebrata; Euteleostomi ; 
Rodent ia; Sciurognathi ; Muridae; Murinae; 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 

FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



Eukaryota ; Metazoa ; 
Mamma 1 i a ; Eut her i a ; 
Rattus. 

1 (bases 1 to 826) 

Casella , S . J . , Smith, E. P., van Wyk, J.J. , Joseph, D.R. , Hynes,M.A., 
Hoy t , E . C . and Lund , P . K . 

Isolation of rat testis cDNAs encoding an insulin-like growth 

factor I precursor 

DNA 6 (4) , 325-330 (1987) 

88003970 

3652906 

Original source text: Rat (Sprague Dawley) testis, cDNA to mRNA, 
clone P2 . 

Location/Qualif iers 
1. .826 

/organism^" Rattus norvegicus" 
/mol_type="mRNA" 
/db_xref-"taxon: 10116" 
259. .642 

/note=" insulin like growth factor I" - 
/codon_start=l 
/protein_id-"AAA41386 .1" 
/db_xref-"GI : 204752" 

/ translat ion="MSSSHLFYLALCLLTFTSSATAGPETLCGAELVDALQFVCGPRG 
FYFNKPTGYGSS I RRAPQTGI VDECCFRSCDLRRLEMYCAPLKPTKSARS I RAQRHTD 
MPKTQKEVHLKNTSRGSAGNKTYRM" 
223 a 214 c 181 g 208 t 



Query Match 71.8%; Score 387; DB 10; Length 82 6; 

Best Local Similarity 88.0%; Pred. No. 1.9e-109; 

Matches 455; Conservative 0; Mismatches 10; Indels 52; 



Gaps 



1; 



Qy 



1 GGACCAGAGACC CTTTG CGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

1 1 1 1 1 1 II 1 1 1 1 i I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 1 1 IIIMI 



Db 



325 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAATTCGTGTGTGGACCA 384 



Qy 61 AGGGGCTTTTACTTCAAC^GCCCA.CAGTCTATGGCTC^GCATTCGGAGGGCACCACAG 120 

II 1 1 Mill II Ml MM Mill II III IIIMMIIMM M MIIIIIIMIII 

Db 385 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCACAG 444 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

Ml ] I I II ill I Ml I I I I I ll I I I I I II I I Ml I II I ; II III II 1 1 1 1 

Db 445 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 5 04 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

II 1 1 I ■ M M I II 1 1 1 Ml M I Ml 1 1 1 II M M ■ M 1 1 1 1 1 1 1 II II 

Db 505 TGTGCTCCGCTGAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 564 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAAAAGGAAGCTGCAAAGG 300 

MIIMIMI 

Db 565 ATGCCCAAGACTCAG 579 

Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 360 

1 1 1 1 Ml I Ml M 1 1 II I II Ml M I II 1 1 1 M I II II I M I Ml 

Db 580 AAGGAAGTACACTTGAAGAAC^CAAGTAGAGGAAGTGCAGGAAACAAGACCTA 632 

Qy 361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 420 

IMIIIIIIII I lllllllllllllllll MIIIIIMIIII I Mill Mill lllllll 

Db 633 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 692 

Qy 421 CTGCTTGAGCAACCTGCAAAACATCGGAACA.CCTGCCAAATATCAATAATGAGTTCAATA 48 0 

III II MM II lllllill IMMMMMIMI Illlllll IMMMIMMIMM 

Db 693 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCAATAATGAGTTCAATA 752 

Qy 481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACA 517 

Ml 1 1 IM M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II I M 

Db 753 CCATTTCAGAGATGGGCATTTCCCTCAATGAAATACA 78 9 



RESULT 15 

MMIGFIAR 

LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



MMIGFIAR 696 bp mRNA linear .ROD 21-MAR-1995 

Mouse mRNA for preproinsul in-like growth factor IA. 

X04480 

X04480.1 GI:51801 

growth factor; insulin-like growth factor IA; preproinsul in- like 
growth factor IA; signal peptide. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 696) 

Bell,G.I., Stempien,M.M. , Fong , N . M . andRall,L.B. 

Sequences of liver cDNAs encoding two different mouse insulin-like 
growth factor I precursors 

Nucleic Acids Res. 14 (20), 7873-7882 (1986) 

87040760 

3774549 

The B-domain of IGF-I comprises residues 1-29 (position 219-305) , 
the C-domain residues 30-41 (position 306-341) , the A-domain 
residues 42-62 (position 342-404) and the D-domain residues 63-70 



FEATURES 

source 



misc feature 



CDS 



sig__peptide 
mat_peptide 
misc_f eature 
polyA_site 



BASE COUNT 
ORIGIN 



(position 405-428) . 

Location/Qualif iers 
1. .696 

/organism="Mus musculus" 
/ mo 1 _ t yp e = 11 mRNA " 
/strain="Balb/c" 
/db_xref="taxon: 10090" 
/clone="migf 1-2" 
/ 1 i s sue_type= " 1 i ver " 
123. .125 

/note="pot. translation start" 
153. .536 

/note="put. preproIGF-IA (AA -22 to 105)" 
/codon_start=l 
/protein_id="CAA2 8168 .1" 
/db_xref ="GI : 51802" 
/db_xref ="MGI : 96432" 
/db_xref="SWISS-PROT:P05017" 

/trans lation="MSSSHLFYLALCLLTFTSSTTAGPETLCGAELVDALQFVCGPRG 
FYFNKPTGYGSSIRRAPQTGIVDECCFRSCDLRRLEMYCAPLKPTKAARSIRAQRHTD 
M P KTQ KE VHLKNTS RG S AGN KT YRM " 
153. .218 

/note="signal peptide (AA -22 to -1)" 
219. .428 

/product = "mature IGF-IA (AA 1-70)" 
429. .533 

/note="COOH- terminal peptide (E domain) (AA 71 to 105)" 

696 

/note="polyA site" 
201 a 200 c 148 g 147 t 



Query Match 67.1%; Score 361.6; DB 10; Length 696; 

Best Local Similarity 83.8%; Pred. No. 1.6e-101; 

Matches 444; Conservative 0; Mismatches 34; Indels 52; Gaps 



1; 



Qy 



Db 



1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

I Mill IIIIIIIIMIIIII MINIMI INN II 1 1 II II I II 1 1 1 1 1 1 1 1 II II 

219 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 278 



Qy 



Db 



61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCT 12 0 

MMMM 1 1 1 111 1 1 1 1 II Ml I Mill lllllllllllll Mill III 

279 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAG(^TTCGGAGGGCACCTCAG 338 



Qy 



Db 



121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

M 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 Ml MMMM III I Ml 1 1 Ml I MMMM II I 

33 9 ACAGG CATTGTGGATGAGTGTTG CTTCCGGAG CTGTGAT CTGAGGAGACTGGAGATGTAC 3 98 



Qy 

Db 

Qy 
Db 



181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCC 24 0 

MINI 1 1 1 1 1 1 1 . 1 MM II II Mill II Mill I MM I II II I 

399 TGTGCCCCACTGAAGCCTAOW^GCAGCCCGCT^ 4 58 

241 ATGCCCAAGACTCAGAAGTC CCAG C CCCTATCGACACACAAGAAAAGGAAG CTG CAAAGG 3 00 

MM MMMM 

459 ATGCCCAAGACTCAG 4 73 



Qy 



301 AGAAGGAAAGGAAGTAOVCTTGAAGAAC^CAAGTAGAGGAAGTGC^GGAAAC^GACCTA 3 60 



Db 4 74 AAGGAAGTACATTTGAAGAAC^a\AGTAGAGGAAGTGCAGGAAACAAGACCTA 526 

Qy 361 CAGAATGTAGGAGGAG CCT CCCGAGGAACAGAAAATG CCACGTCAC CGCAAGATCCTTTG 420 

MMIMIIMM IMIII III I i I 1 1 1 M 1 1 1 Mill 1 1 f 1 1 1 1 1 1 

Db 527 CAGAATGTAGGAGGAGCCTCCCACGGAGCAGAAAATGCCACATCACCGCAGGATCCTTTG 586 

Qy 421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTC 480 

1 1 1 II M 1 1 1 1 1 1 I II I II 1 1 1 M III I Mill MM 1 1 1 Mill 

Db 587 CTGCTTGAGCT^CCTGCAAAACATCGAAAC^CCTACCAAATAACAATAATAAGTCCAATA 646 

Qy 481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTC 530 

1 1 1 1 II 1 1 Mill II I II I II II Ml II III MM Mill I MM 

Db 647 ACATTACAAAGATGGG CATTTC CC C CAATGAAATATACAAGTAAAGATT C 696 



Search completed: December 13, 2003, 09:27:34 
Job time : 2410.26 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2 003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



Searched : 



December 13, 2003, 02:35:18 ; Search time 216.419 Seconds 

(without alignments) 
6723.048 Million cell updates/sec 

US-09-852-261-3 
539 

1 ggaccagagaccctttgcgg agtaaacattcccggaattc 53 9 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

2552756 seqs., 1349719017 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -process ing : Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 
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Pred. No. is the number of results predicted by chance to have a 



score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
AAD06399 

ID AAD06399 standard; cDNA ; 539 BP. 
XX 

AC AAD06399; 
XX 

DT 10-AUG-2001 (first entry) 
XX 

DE Rat IGF-I isoform mechano -growth factor (MGF) cDNA. 
XX 

KW Rat; IGF-I isoform; Insulin-like Growth Factor-I; MGF; 

KW mechano -growth factor; neurological disorder; neurodegenerative disorder; 

KW amyotrophic lateral sclerosis; spinal muscular atrophy; muscular atrophy; 

KW poliomyelitis; post-polio syndrome; toxin; motoneurone disorder; 

KW nerve damage; autosomal muscular dystrophy; diabetic neuropathy ; 

KW sex -linked muscular dystrophy; peripheral neuropathy; 

KW Alzheimer's disease; Parkinson's disease; ss. 

XX 

OS Rattus sp. 



XX 

FH Key Location/Qualifiers 

FT CDS 1..336 
FT /*tag= a 

FT /product^ "Mechano -growth factor (MGF) " 

FT /note= "This region comprises exons 3-6. The CDS does 

FT not include start codon" 

FT /partial 

XX 



PN WO200136483-A1. 
XX 

PD 25-MAY-2001. 
XX 

PF 15-NOV-2000; 2000WO-GB04354 . 
XX 

PR 15-NOV-1999; 99GB-0026968 . 
XX 

PA (UNLO ) UNIV COLLEGE LONDON. 
XX 

PI Goldspink G, Johnson I; 
XX 

DR WPI; 2001-355620/37. 

DR P-PSDB; AAE02448. 
XX 

PT Use of mechano -growth factor, an isoform of Insulin-like Growth 

PT Factor-I, capable of reducing motoneurone loss, in the manufacture of a 

PT medicament for the treatment of neurological disorder - 

XX 

PS Claim 4; Page 51-52; 66pp; English. 
XX 

CC The present invention relates to use of mechano -growth factor (MGF) , 

CC an Insulin-like Growth Factor-I (IGF-I) isoform in the manufacture of a 

CC medicament for the treatment of neurological disorder. The MGF is capable 

CC of reducing motoneurone loss by 2 0% or greater in response to nerve 

CC avulsion, and effects motoneurone rescue, preferably adult motoneurone 

CC rescue. The MGF polynucleotide and polypeptide are useful in the 

CC manufacture of a medicament for the treatment of a neurological disorder, 



CC including a disorder of motoneurones and/or neurodegenerative disorder, 

CC e.g., amyotrophic lateral sclerosis, spinal muscular atrophy, progressive 

CC spinal muscular atrophy, infantile or juvenile muscular atrophy, 

CC poliomyelitis or post-polio syndrome, a disorder caused by exposure to a 

CC toxin, motoneurone trauma, a motoneurone lesion or nerve damage, an 

CC injury that affects motoneurones, motoneurone loss associated with aging, 

CC autosomal or sex-linked muscular dystrophy, diabetic neuropathy, 

CC peripheral neuropathies, Alzheimer's disease and Parkinson's disease. 

CC The present sequence is rat IGF- I isoform MGF cDNA. MGF is a muscle 

CC isoform having extracellular (Ec) domain, hence also referred as 

CC IGF-I-Ec. The MGF protein comprises amino acid sequences encoded by 

CC nucleic acid sequence of IGF-I exons 4, 5 and 6 in the reading frame 

CC of MGF. 



XX 

SQ Sequence 539 BP; 161 A; 136 C; 139 G; 103 T; 0 other; 

Query Match 100.0%; Score 539; DB 22; Length 539; 

Best Local Similarity 100.0%; Pred. No. 1.9e-142; 

Matches 539; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

I M, I I I I I I I I I I I I Ml Ml ! I Ml I Ml I 

Db 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 6 0 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCAC^GTCTATGGCTC 120 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 

Db 61 AGGGGCTTTTACTTCAACAA.GCCCACAGTCTATGGCTCCAGCATTCGGAGGGCAC 120 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

i i i i. i u i i i i u i 'i i i i i i. ri M i 

Db 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

IIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMMIIIIIIIIIIIMIMI 

Db 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGC^ 24 0 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAG 3 00 

IIIIIIMII 1 1 1 1 MM II II I! I MINIMI II IIIIIIIIIII I IIIIIIIIIII I 

Db 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACA 3 00 

Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTG CAGGAAACAAGAC CTA 3 60 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 ! I i I 

Db 3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 3 60 

Qy 361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTC 42 0 

I Mill IMM.I I IIIIIIIIIII I IMMMMMMIIMIMM 1 1 IIIIIIIIIII 

Db 3 61 CAGAATGTAGGAGGAGC CTC CCGAGGAACAGAAAATG C CACGTCACCG CAAGATCCTTTG 42 0 

Qy 421 CTGCTTGAGCAACCTGCAAAACATCGG 480 

I I I I I ;l I I I I I Ml I I I I I I I M I III I I 

Db 421 CTG CTTGAGCAAC CTG CAAAACATCGGAACACCTG CCAAATATCAATAATGAGTTCAATA 48 0 

Qy 481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCCCGGAATTC 53 9 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATAC^ 53 9 



RESULT 2 
AAS16878 

ID AAS16878 standard; cDNA; 539 BP. 
XX 

AC AAS16878; 
XX 

DT 25-FEB-2002 (first entry) 
XX 

DE Rat mechano -growth factor (MGF) cDNA. 
XX 

KW Rat; mechano -growth factor; insulin- like growth factor I; IGF- I; MGF; 

KW neuroprotective; nerve damage; peripheral nervous system; nerve severing ; 

KW muscle; neurological disorder; motoneuron loss; motorneuron disorder; ss; 

KW ' nerve avulsion. 

XX 

OS Rattus sp. 



XX 

FH Key Locat ion/Qualif iers 

FT CDS 1. .336 

FT /*tag= a 

FT /product = "Rat MGF" 

FT /partial 

FT /note= "No start codon" 

FT exon 1 . . 75 

FT /*tag= b 

FT /number= exon 3 

FT exon 76 . .258 

FT /*tag= c 

FT /number= exon 4 

FT exon 259 . .309 

FT /*tag= d 

FT / number^ exon 5 

FT exon. 310. .333 

FT /*tag= e 

FT /number =■ exon 6 

XX 



PN WO200185781-A2 . 
XX 

PD 15-NOV-2001. 
XX 

PF 10-MAY-2001; 2 00 1WO-GB02 054 . 
XX 

PR 10-MAY-2000; 2 00 OGB-0 0 11278 . 
XX 

PA (UNLO ) UNIV COLLEGE LONDON . 

PA (EGRI-) EAST GRINSTEAD MEDICAL RES TRUST. 

XX 

PI Goldspink G, Terenghi G; 
XX 

DR WPI; 2002-055585/07. 

DR P-PSDB; AAU10560 . 
XX 

PT Use of insulin-like growth factor I (IGF-I) isoform known as 

PT mechano-growth factor which is encoded by IGF-I exons 4,5,6 and has 

PT ability to reduce motoneuron loss in response to nerve avulsion, to 

PT treat nerve damage 

XX 



PS Disclosure; Fig 6; 65pp; English. 
XX 

CC The invention relates to the use of an insulin-like growth factor I 

CC (IGF-I) i so form, known as mechano -growth factor (MGF) , in the manufacture 

CC of a medicament for treating nerve damage in the peripheral nervous 

CC system, or for treating nerve damage by localising MGF at the site of 

CC damage. The nerve damage may include severing of a nerve. The treatment 

CC may be combined with another treatment (such as a polypeptide growth 

CC factor other than MGF) that prevents or diminishes degeneration of the 

CC target organ (for example, muscle) which the damaged nerve innervates, 

CC whereby the treatment of the muscle with MGF or a polynucleotide encoding 

CC MGF prevents or diminishes degeneration. The method is useful for 

CC treating neurological disorders, preferably motorneuron disorders. These 

CC methods can reduce motoneuron loss by 2 0% or greater in response to nerve 

CC avulsion. This sequence represents cDNA encoding the rat MGF. 

XX 

SQ Sequence 539 BP; 161 A; 136 C; 139 G; 103 T; 0 other; 

Query Match 100.0%; Score 539; DB 24; Length 539; 

Best Local Similarity 100.0%; Pred. No. 1.9e-142; 

Matches 539; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r i 

Db 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 6 0 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCC^ 12 0 

I MMMM MIMIMI IMIMIII lllllllll MIIIIIIMIMMI 

Db 61 AGGGGCTTTTACTTCAACAAGCCCAC^GTCTATC^ 120 

Qy 121 ACGGGCATTGTGGATGAGTGTTG CTT CCGGAG CTGTGAT CTGAGGAGG CTGGAGATGTAC 180 

I 1 1 II 1 1 1 1 1 1 1 1 1 ! II 1 1 :! Ml 1 1 II , I ! 1 1 1 II II 1 1 1 1! 1 1 1 M I i I M 

Db 121 ACGGG CATTGTGGATGAGTGTTG CTT CCGGAG CTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCG 24 0 

IIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIII 

Db 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

Qy 241 ATG C CCAAGACTCAGAAGTCCCAG C CC CTAT CGACACACAAGAAAAGGAAG CTGCAAAGG 3 00 

III MMMIIMI III Mill 1 1 IIIIMII I MM Mill I Mill Mill Illllll 

Db 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACA 300 

Qy 3 01 AGAAGGAAAGGAAGTAOVCTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 360 

II MIIMIII MMMM IMMIII MMIIMI MIIIIIIMIMMI 

Db 3 01 AGAAGGAAAGGAAGTACACTTGAAGAA^C^GTAGAGGAAGTGCAGGAAACAAGACCTA 3 60 

Qy 361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 420 

III I M 1 1 1 1 1 M I IIMIIMI I IIIIMII MINIMI I Mill MM I Illllll 

Db 3 61 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 420 

Qy 421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCC 480 

III I II II I Mill III Mill 1 1 MMMM Mllllllll I lllllllll IIIIMII 

Db 421 CTG CTTGAGCAACCTG CAAAACAT CGGAACAC CTGC CAAATAT CAATAATGAGTTCAATA 480 

Qy 481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATAC^ 53 9 

IIMIIMI Mill III Mill 1 1 IMMIII Mllllllll I Mill Mill llllll 

Db 481 TCATTTCAGAGATGGGC^TTTCCCTCAATGAAATACACAAGTAAACATTCCCGGAATTC 53 9 



RESULT 3 
ABV76185 

ID ABV76185 standard; cDNA; 651 BP. 
XX 

AC ABV76185; 
XX 

DT 07-MAR-2003 (first entry) 
XX 

DE Mouse insulin- like growth factor IB cDNA. 
XX i 

KW Insulin-like growth factor IB; IGF-IB; mouse; mRNA; assay; 

KW nucleic acid detection; gene; ss. 

XX 

OS Mus mus cuius. 
XX 

FH Key Location/Qualif iers 

FT CDS 73 . .474 

FT /*tag= a 

FT /product = "IGF-IB" 

XX 

PN WO200297390-A2 . 
XX 

PD 05-DEC-2002. 
XX 

PF 31-MAY-2002; 2002WO-SE01056 . 
XX 

PR 01-JUN-2001; 2001SE- 0001934 . 
XX 

PA (BIOV-) BIOVITRUM AB . 
XX 

PI Parrow V, Rosengren L; 
XX 

DR WPI; 2003-129529/12. 
XX 

PT Quantitating a target nucleic acid in a sample comprises immobilizing, 

PT on a solid support, a sample comprising a target nucleic acid, and 

PT detecting and quantitating signals generated from the antisense and 

PT sense probes - 
XX 

PS Example 1; Page 16-17; 18pp; English. 
XX 

CC The present sequence is that of cDNA encoding murine insulin-like 

CC growth factor IB (IGF-IB) . The cDNA was used in an example of the 

CC method of the invention to generate probes for determination of 

CC IGF-IB RNA. The method comprises a quantitative hybridisation 

CC assay for analysis of mRNA in a target nucleic acid (TNA) sample/ 

CC It involves: (i) immobilising the TNA sample on a solid support; 

CC (ii) contacting a labelled antisense probe to a first portion of the 

CC TNA, and a labelled sense probe to a second portion of the TNA; 

CC (iii) detecting and quantitating the signals generated from the 
CC 7 hybridised probes; and (iv) determining the value represented by 

CC the antisense probe signal minus the sense probe signal, the value 

CC being proportional to the amount of mRNA in the TNA sample. In an 

CC example of the method, a cDNA clone containing . 60 nucleotides from 

CC exon 2 and 179 nucleotides from exon 3 of the mouse IGF-IB gene was 



CC cloned into pGEN-4Z vector. Linearisation of the plasmid with 

CC EcoRI allowed transcription of a 250-nucleotide antisense probe 

CC using T7 polymerase. Linearisation with Hindi I I allowed 

CC transcription of a sense probe of similar length using SPG 

CC polymerase (see ABV76186) . The probes were purified and used to 

CC determine IGF-I RNA in mouse hepatocytes and also in rat hepatocytes . 

XX 

SQ Sequence 651 BP; 193 A; 185 C; 149 G; 124 T; 0 other; 



Query Match 84.2%; 
Best Local Similarity 92.8%; 
Matches 476; Conservative 



Score 453.8; DB 25; Length 651; 
Pred. No. 2.5e-118; 
0; Mismatches 37; Indels 0; 



Gaps 



0; 



Qy 



Db 



1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

lllllll MINIM lllllll I Mill lllllll IIMM II Mill Mill I III I 

13 9 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 198 



Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy. 
Db 

Qy 

Db 



61 AGGGGCTTTTACTTCAACAAGCCCACAG 12 0 

lllllll I III MM 1 1 II Ml I Mill IMIMIMI I lllllll M lllllll III 

199 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCTCAG 258 
121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II 1 1 1 1 IM II 1 : 1 1 II M I I III I ! I 1 1 1 1 MM 1 1 1 II IMMMIMM 

2 5 9 ACAGG CATTGTGGATGAGTGTTG CTTCCGGAGCTGTGAT CTGAGGAGACTGGAGATGTAC 318 
181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTC^ 24 0 

MINI Mill Mill I MM II II Mill IIIIIIIIIIIIMIIII 

319 TGTGCCCCACTGAAGCCTAOUUUVG(^ 3 78 

241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGAC^ 3 00 

IIIIIMIIIIMIIIIIMM I 1 1 1 1 1 1 1 1 1 1 1 II I III I II lllllllllllll 

379 ATGCCCAAGACTCAGAAGTCCCCGTCCCTATCGACAAACA^ 438 

3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 3 60 

lllllll MM I III Ml I Mill 1 1 1 11 1 1 11 II 1 1 1 1 1 III II IMIMIMM 

439 AGAAGGAAAGGAAGTAC^TTTGAAGAACAC^GTAGAGGAAGTGCAGGAAAa^GACCTA 4 98 
361 CAGAATGTAGGAGGAG CCTCCCGAGGAACAGAAAATG C CACGTCACCGCAAGAT C CTTTG 42 0 

II IMMMIMM II I MM I III IIIIIIMMIII II Ml Ml II II III II 

4 99 CAGAATGTAGGAGGAGCCTCCCACGGAGCAGAAAATGCCACATCACCGCAGGATCCTTTG 558 

421 CTGCTTGAGCAACCTGCAAAACATCGGAACA 4 8 0 

II M I I II II II II I I II II I I I I II IMIMI lllllll lllllll III Mill 
559 CTGCTTGAGCAACCTGCAAAACATCGAAACACCT^ 618 

481 TCATTTCAGAGATGGGCATTTCC.CTCAATGAAA 513 

MM II 1 1 1 M lilllll I 11:1111 

619 ACATTACAAAGATGGGCATTTCCCCCAATGAAA 651 



RESULT 4 
AAD06404 

ID AAD06404 standard; cDNA; 487 BP. 
XX 

AC AAD064 04; 
XX 

DT 10-AUG-2001 (first entry) 



XX 

DE Rat liver-type IGF-I isoform (L.IGF-I) cDNA. 
XX 

KW Rat; IGF-I isoform; Insulin-like Growth Factor-I; MGF; 

KW mechano -growth factor; neurological disorder; neurodegenerative disorder; 

KW amyotrophic lateral sclerosis; spinal muscular atrophy; muscular atrophy; 

KW poliomyelitis; post -polio syndrome; toxin; motoneurone disorder ; 

KW nerve damage; autosomal muscular dystrophy; diabetic neuropathy; 

KW sex -linked muscular dystrophy; peripheral neuropathy; 

KW Alzheimer's disease; Parkinson's disease; liver; L.IGF-I; ss. 

XX 

OS Rattus sp. 
XX 

FH Key Location/Qualifiers 

FT CDS 1. .318 

FT /*tag= a 

FT /product- "Liver-type IGF-I isoform . (L.IGF-I ) n 

FT /transl_except= (pos : 304 . . 306, aa:Tyr) 

FT /note= "The translation exception occurs while 

FT decoding the alternative version of the protein 

FT (AAE02531) . The CDS comprises exons 3, 4 and 6 and does 

FT not include start codon" 

FT /partial 

XX 

PN WO200136483-A1. 
XX 

PD 25-MAY-2001 . 
XX 

PF 15-NOV-2000; 2000WO-GB04354 . 
XX 

PR 15-NOV-1999; 99GB-0026968 . 
XX 

PA (UNLO ) UN IV COLLEGE LONDON. 
XX 

PI Goldspink G, Johnson I; 
XX 

DR WPI; 2001-355620/37. 

DR P-PSDB; AAE02451, AAE02531. 

XX 

PT Use of mechano-growth factor, an isoform of Insulin-like Growth 

PT Factor-I, capable of reducing motoneurone loss, in the manufacture of a 

PT medicament for the treatment of neurological disorder - 

XX 

PS Disclosure; Page 57-58; 66pp; English. 
XX 

CC The present invention relates to use of mechano-growth factor (MGF) , 

CC an Insulin-like Growth Factor-I (IGF-I) isoform in the manufacture of a 

CC medicament for the treatment of neurological disorder. The MGF is capable 

CC of reducing motoneurone loss by 20% or greater in response to nerve 

CC avulsion, and effects motoneurone rescue, preferably adult motoneurone 

CC rescue. The MGF polynucleotide and polypeptide are useful in the 

CC manufacture of a medicament for the treatment of a neurological disorder, 

CC including a disorder of motoneurones and/or neurodegenerative disorder, 

CC e.g., amyotrophic lateral sclerosis, spinal muscular atrophy, progressive 

CC spinal muscular atrophy, infantile or juvenile muscular atrophy, 

CC poliomyelitis or post-polio syndrome, a disorder caused by exposure to a 

CC toxin, motoneurone trauma, a motoneurone lesion or nerve damage, an 



CC injury that affects motoneurones, motoneurone loss associated with aging, 

CC autosomal or sex-linked muscular dystrophy, diabetic neuropathy, 

CC peripheral neuropathies, Alzheimer's disease and Parkinson's disease. 

CC The present sequence is rat liver-type IGF-I isoform (L.IGF-I) cDNA. 

CC The L.IGF-I protein comprises amino acid sequences encoded by 

CC nucleic acid sequence of IGF-I exons 4 and 6. 

XX 

SQ Sequence 487 BP; 139 A; 123 C; 126 G; 99 T; 0 other; 

Query Match 78.8%; Score 425; DB 22; Length 487; 

Best Local Similarity 90.4%; Pred. No. 3.1e-110; 

Matches 4 87; Conservative 0; Mismatches 0; Indels 52; Gaps 1; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

1 1 1 1 1 1 M 1 1 1 M M 1 1 1 II I ! I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 II 1 1 1 

Db 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATC 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGC 120 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

IIIIIIIIIM II III IMM I MIMMIMM I IMMMIMI III MIMIMIM 

Db 121 ACGGGCATTGTGGATGAGTGTTG CTTCCGGAG CTGTGAT CTGAGGAGG CTGGAGATGTAC 180 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGT^ 240 

MIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMI 
Db 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAA 3 00 

IIIIIIMIIIMM 

Db 241 ATGCCCAAGACTCAG 255 

Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 3 60 

III IMMMMII IMIIIIIIII IIIIIIIIIM IIIIIIIIIM 

Db. 256 - - AAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 3 08 

Qy 3 61 CAGAATGTAGGAGGAGCCTC CCGAGGAACAGAAAATG C CACGTCAC CG CAAGATCCTTTG 42 0 

IIIIIIIIIM II III Mill I II I Ml MM I II I MM Ml I II II II I III II Ml I 

Db 309 CAGAATGTAGGAGGAGCCT C CCGAGGAACAGAAAATG C CACGTCAC CG CAAGATCCTTTG 3 68 

Qy 421 CTGCTTGAGCAACCTGCAAAACATCGGAACA 4 80 

I I ^ I I I I I I I I I I I I I I E I I I I I I I t I I I I I I I I i I I I I I I I - I I I I I I I I I I I I 

Db 3 69 CTGCTTGAGCAACCTGCAAAACATCGGAACAC 428 

Qy 481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATAC 53 9 

Illlllll MMMIIMI MMIMIMI MIMMIMI MMMIMI 

Db 42 9 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCCCG 4 87 

RESULT 5 
AAS16883 

ID AAS16883 standard; cDNA; 487 BP: 
XX 

AC AAS16883; 
XX 

DT 25-FEB-2002 (first entry) 



XX 

DE Rat insulin-like growth factor I liver-type isoform (L.IGF-I) cDNA . 
XX 

KW Rat; mechano -growth factor; insulin-like growth factor I; IGF-I; MGF; 

KW neuroprotective; nerve damage; peripheral nervous system; nerve severing; 

KW muscle; neurological disorder; motoneuron loss; motorneuron disorder; ss; 

KW nerve avulsion; insulin-like growth factor I liver-type isoform; L.IGF-I; 
XX 

OS Rattus sp. 
XX 

FH Key Location/Qualifiers 

FT CDS 1..318 

FT /*tag= a 

FT /product^ "Rat L.IGF-I" 

FT /partial 

FT /note= "No start codon" 

FT exon 1 . . 75 

FT /*tag= b 

FT /number= exon 3 

FT exon 76 . . 258 

FT /*tag= c 

FT /number= exon 4 

FT exon 259. .315 

FT /*tag= d 

FT /number= exon 6 

XX 

PN WO200185781-A2 . 
XX 

PD 15-NOV-2001. 
XX 

PF 10-MAY-2001; 2 00 1WO-GB02 054 . 
XX 

PR 10-MAY-2000; 2000GB-0011278 . 
XX 

PA (UNLO ) UN IV COLLEGE LONDON. 

PA (EGRI-) EAST GRINSTEAD MEDICAL RES TRUST. 

XX 

PI Goldspink G, Terenghi G; 
XX 

DR WPI; 2002-055585/07. 

DR P-PSDB; AAU10563. 
XX 

PT Use of insulin-like growth factor I (IGF-I) isoform known as 

PT mechano -growth factor which is encoded by IGF-I exons 4,5,6 and has 

PT ability to reduce motoneuron loss in response to nerve avulsion, to 

PT treat nerve damage 

XX 

PS Disclosure; Fig 9; 65pp; English. 
XX 

CC The invention relates to the use of an insulin-like growth factor I 

CC (IGF-I) isoform, known as mechano -growth factor (MGF) , in the manufacture 

CC of a medicament for treating nerve damage in the peripheral nervous 

CC system, or for treating nerve damage by localising MGF at the site of 

CC damage. The nerve damage may include severing of a nerve. The treatment 

CC may be combined with another treatment {such as a polypeptide growth 

CC factor other than MGF,) that prevents or diminishes degeneration of the 

CC target organ (for example, muscle) which the damaged nerve innervates, 



CC whereby the treatment of the muscle with MGF or a polynucleotide encoding 

CC MGF prevents or diminishes degeneration. The method is useful for 

CC treating neurological disorders, preferably motorneuron disorders. These 

CC methods can reduce motoneuron loss by 2 0% or greater in response to nerve 

CC avulsion. This sequence represents cDNA encoding the rat insulin-like 

CC growth factor I liver-type isoform (L.IGF-I) used in experiments on 

CC motoneuron loss. 

XX 

SQ Sequence 487 BP; 139 A; 123 C; 126 G; 99 T; 0 other; 



Query Match 78 . 8< 

Best Local Similarity 90.4' 
Matches 487; Conservative 



Score 425; DB 24; Length 487; 
Pred. No. 3.1e-110; 
0; Mismatches 0; Indels 52; 



Gaps 



1; 



QY 



Db 



1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

I MINI MINI IMIMIIIIMII MINI MINI llilll Mill 

1 GGACCAGAGAC C CTTTG CGGGG CTGAG CTGGTGGACG CT CTT CAGTT CGTGTGTGGACCA 60 



Qy 



Db 



61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGCACCACAG 120 

I I I N I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I II I I I II 

61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATC 120 



Qy 

Db 

Qy 

Db 



121 ACGGG CATTGTGGATGAGTGTTGCTT CCGGAG CTGTGATCTGAGGAGG CTGGAGATGTAC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

181 TGTGTCCGCTGCAAGCCTACAAAGT»GCTCGTTCC^TCCGGGCCCAGCGC(^CACTGAC 24 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 M 1 1 M I M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 

181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 



Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGAaVCA(^GAAAAGGAAGCTGCAAAGG 3 00 

IIUIMII II! 

Db 241 ATGCCCAAGACTCAG 255 

Qy 3 01 AGAAGGAAAGGAAGTAC^CTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 360 

IIIIIIIIIIIMIIIIIIIIIIIIIIMMIIIIMIIIIIIIIIIIIIIII 

Db 256 AAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 3 08 



Qy 361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

Db 309 CAGAATGTAGGAGGAGCCTCCCGAGGAAC^GAAAATGCCACGTCACCGCAAGATCCTTTG 368 

Qy 421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCT^ 48 0 

II IIIIIIIMIIIill MINIMI III II MINIM NIMH llllll MINIM 

Db 3 69 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAA 428 



Qy 481 Tt^TTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCCCGGAATTC 53 9 

II I llllll llllll II Mill I III I II I llllll I MINI I MINI llllll 

Db 42 9 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCCCGGAATTC 4 87 



RESULT 6 
AAX27498 

ID AAX27498 standard; DNA; 1052 BP. 
XX 

AC AAX27498; 
XX 



DT 22-JUN-1999 (first entry) 
XX 

DE Rat liver form of IGF-1. 
XX 

KW Liver; isoform; rat; insulin-like growth factor; IGF1; vertebrate; 

KW muscle mass; human; gravity; cosmetic body sculpting; glucose clearance; 

KW diabetic; ss. 

XX 

OS Rattus sp. 
XX 

PN WO9910013-A1 . 
XX 

PD 04-MAR-1999. 
XX 

PF 25-AUG-1998; 98WO-US17428 . 
XX 

PR 25-AUG-1997; 97US- 00572 01 . 
XX 

PA (MASS- ) MASSACHUSETTS GEN HOSPITAL. 

PA (UYPE-) UNIV PENNSYLVANIA . 

XX 

PI Rosenthal NA, Sweeney HL; 
XX 

DR WPI; 1999-190469/16. 
XX 

PT Use of modified insulin-like growth factor I - in gene therapy for 

PT increasing vertebrate muscle. strength and mass 

XX 

PS Example 1; Fig 4; 46pp; English. 
XX 

CC This sequence represents the liver isoform of the rat insulin-like 

CC growth factor 1 (IGF1) from rats. IGF-1 or a modified or biologically 

CC active portion can be used for increasing vertebrate muscle mass by 

CC intramuscular administration, for e.g. (i) preserving or enhancing 

CC muscle strength in aging humans; (ii) healing injured muscle more 

CC efficiently/rapidly; (iii) controlling muscle mass during disease and/or 

CC prolonged stays in reduced gravity; (iv) cosmetic body sculpting; and 

CC (v) promoting glucose clearance from diabetic muscle tissue. 

XX 

SQ Sequence 1052 BP; 301 A; 256 C; 225 G; 270 T; 0 other; 

Query Match 74.7%; Score 4 02.6; DB 20; Length 1052; 
Best Local Similarity 88.5%; Pred. No. 8.9e-104; 

Matches 4 70; Conservative 0; Mismatches 9; Indels 52; Gaps 1; 

Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Mill I MINI MINI I Mill I Mill 1 1 1 1 II 1 1 Mill I II II 1 1 MINI I III 

Db 10 9 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 168 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCC^ 12 0 

I II I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I II II II I I I I 

Db 16 9 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCA^ 228 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

IMIIMIIMIIIIIMIIIIIIIIIMIIIMIIIIIIIIIIMMIIIMIIIIMI 

Db 22 9 ACGGG CATTGTGGATGAGTGTTG CTTCCGGAG CTGTGATCTGAGGAGG CTGGAGATGTAC 2 88 



Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

MM I MMMMMMMMMMMMMMMMMMMMMMMM 

Db 289 TGTGCTCCGCTGAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 348 

Qy 241 ATGCCC^GACTCAGAAGTCCCAGCCCCTATCGACACACAAGAAAAGGAAGCTGCAAAGG 3 00 

1 1 [ M 1 1 1 1 1 1 N 1 1 

Db 34 9 ATGCCCAAGACTCAG 363 

Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAACAC^GTAGAGGAAGTGCAGGAAAa^GACCTA 360 

I M I : M : 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 ! I I I I I I M I I I I I I I M I I I I I I I I 

Db 364 AAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 416 

Qy 361 (^GAATGTAGGAGGAGCCTCCCGAGGAAC^GAAAATGCCACGTCACCGCAAGATCCTTTG 420 

M M M M M M M M M M M M M M M M M M M M M M M M M M M M M M 

Db 417 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 476 

Qy 421 CTGCTTGAGCAACCTGCAAAACATCGGAAC^ 4 80 

MMMMMM MMMMMMMMMMMMMMMMMMMMMMMM 

Db 4 77 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCAATAATGAGTTCAATA 536 

Qy 481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCC 531 

MMMMM MMMMMMMMMMMMM MMMMMM M 

Db 537 CCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCC 587 

RESULT 7 
AAD06400 

ID AAD06400 standard; cDNA; 523 BP. 
XX 

AC AAD06400; 
XX 

DT 10-AUG-2001 (first entry) 
XX 

DE Rabbit IGF-I isoform mechano -growth factor (MGF) cDNA . 
XX 

KW Rabbit; IGF-I isoform; Insulin-like Growth Factor-I; MGF; 

KW mechano -growth factor; neurological disorder; neurodegenerative disorder ; 

KW amyotrophic lateral sclerosis; spinal muscular atrophy; muscular atrophy; 

KW poliomyelitis; post -polio syndrome; toxin; motoneurone disorder; 

KW nerve damage; autosomal muscular dystrophy; diabetic neuropathy ; 

KW sex-linked muscular dystrophy; peripheral neuropathy; 

KW Alzheimer's disease; Parkinson's disease; ss. 

XX 

OS Oryctolagus cuniculus . 
XX 

FH Key Location/Qualifiers 
FT CDS 1 . .336 

FT /*tag= a 

FT /product= "Mechano -growth factor (MGF) " 

FT /note= "This region comprises exons 3-6. The CDS does 

FT not include start codon" 

FT /partial 

XX 

PN WO200136483-A1. 
XX 

PD 25-MAY-2001. 
XX 



PF 15-NOV-2000; 2 000WO-GB043 54 . 
XX 

PR 15-NOV-1999; 99GB-0026968 . 
XX 

PA (UNLO ) UNIV COLLEGE LONDON. 
XX 

PI Goldspink G # Johnson I; 
XX 

DR WPI; 2001-355620/37. 

DR P-PSDB; AAE02449. 
XX 

PT Use of me chano -growth factor, an isoform of Insulin- like Growth 

PT Factor-I, capable of reducing motoneurone loss, in the manufacture of a 

PT medicament for the treatment of neurological disorder - 

XX 

PS Claim 4; Page 53-54; 66pp ; English. 
XX 

CC The present invention relates to use of mechano-growth factor (MGF) , 

CC an Insulin-like Growth Factor-I (IGF-I) isoform in the manufacture of a 

CC medicament for the treatment of neurological disorder. The MGF is capable 

CC of reducing motoneurone loss by 2 0% or greater in response to nerve 

CC avulsion, and effects motoneurone rescue, preferably adult motoneurone 

CC rescue. The MGF polynucleotide and polypeptide are useful in the 

CC manufacture of a medicament for the treatment of a neurological disorder, 

CC including a disorder of motoneurones and/or neurodegenerative disorder, 

CC e.g., amyotrophic lateral sclerosis, spinal muscular atrophy, progressive 

CC spinal muscular atrophy, infantile or juvenile muscular atrophy, 

CC poliomyelitis or post -polio syndrome, a disorder caused by exposure to a 

CC toxin, motoneurone trauma, a motoneurone lesion or nerve damage, an 

CC injury that affects motoneurones, motoneurone loss associated with aging, 

CC autosomal or sex- linked muscular dystrophy, diabetic neuropathy, 

CC peripheral neuropathies, Alzheimer's disease and Parkinson's disease. 

CC The present sequence is rabbit IGF-I isoform MGF cDNA. MGF is a muscle 

CC isoform having extracellular (Ec) domain, hence also referred as 

CC IGF-I -Ec. The MGF protein comprises amino acid sequences encoded by 

CC nucleic acid sequence of IGF-I exons 4, 5 and 6 in the reading frame 

CC of MGF . 

XX 

SQ Sequence 523 BP; 154 A; 129 C; 142 G; 98 T; 0 other; 

Query Match 66.2%; Score 356.8; DB 22; Length 523; 

Best Local Similarity 82.3%; Pred. No. 6.3e-91; 

Matches 436; Conservative 0; Mismatches 87; Indels 7; Gaps 2; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Mill Mill II Mill MIMMMIMM I MINI MIIIIIMII 

Db 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTC 120 

1 1 II I M II 1 1 1 1 1 1 1 1 1 1 II I II II I -ill'!!;!,; 1 

Db 61 AGGGGCTTTTATTTC^(^GCC(^CAGGATACGGCTCCAGCAGTCGGAGGG(^CCT(^G 12 0 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II Mill MMIMIMI MMIMMMMIMMMMMMMIMMIIMM 

Db 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 



MM II 1 1 i 1 1 1 MMI MM M Ml I M I 1 1 1 II II I II 1 1 II Ml 

Db 181 TGTGCACCCCTCAAGCCGGCAAA^ 24 0 

Qy 241 ATGCCCAAGACTCAGAAGTCCC^GCCCCTATCGACACACAAGAAAAGGAAGCTGCAAAGG 300 

IMMIIIMIIIIIIIII Mill I III II Illllllll MM II III 

Db 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCM 3 00 

Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 360 

Ml IIMIIII III II I III MIMI I Mill II Ml MIMIMI I Ml 

Db 3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

Qy 3 61 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 42 0 

III IIIIIMI II III I Mill I II I II I IIIIIII II MIMI 

Db 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 

Qy 421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCAATAATGAGTTC^TA 48 0 

II I 1 1 1 1 1 1 IIIIIII 1 1 1 1 II III MM II 

Db 421 CTGTGCACAGTTACCTGTAAACATTGGAATACCGGCCA - AAAAATAAGTTTGATC 474 

Qy 481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTC 53 0 

MM III MM MIMIMI I II I II I II I II II II II I II II II I 

Db 475 ACATTTCAAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 



RESULT 8 
AAS16879 

ID AAS16879 standard; cDNA; 523 BP. 
XX 

AC AAS16879; 
XX 

DT 25-FEB-2002 (first entry) 
XX 

DE Rabbit mechano -growth factor (MGF) cDNA. 
XX 

KW Rabbit; mechano -growth factor; insulin- like growth factor I; IGF- I; MGF; 

KW neuroprotective; nerve damage; peripheral nervous system; nerve severing; 

KW muscle; neurological disorder; motoneuron loss; motorneuron disorder; ss; 

KW nerve avulsion. 
XX 

OS Oryctolagus cuniculus. 
XX 

FH Key Location/Qualif iers 

FT CDS 1..336 

FT /*tag= a 

FT /product = "Rabbit MGF" 

FT /partial 

FT /note= "No start codon" 

FT exon 1 . . 76 

FT /*tag= b 

FT /number= 3 

FT exon 77. .259 

FT /*tag= c 

FT /number = 4 

FT exon 260 . .309 

FT /*tag= d 

FT /number^ 5 

FT exon 311. .333 



FT /*tag= e 

FT /number= 6 

XX 

PN WO200185781-A2 . 
XX 

PD 15-NOV-2001. 
XX 

PF 10-MAY-2001; 2 00 1WO-GB02 054 . 
XX 

PR 10-MAY-2000; 2000GB-0011278 . 
XX 

PA (UNLO ) UN IV COLLEGE LONDON. 

PA (EGRI-) EAST GRINSTEAD MEDICAL RES TRUST. 

XX 

PI Goldspink G, Terenghi G; 
XX 

DR WPI; 2002-055585/07. 

DR P-PSDB; AAU10561. 
XX 

PT Use of insulin-like growth factor I (IGF-I) isoform known as 

PT mechano-growth factor which is encoded by IGF-I exons 4,5,6 and has 

PT ability to reduce motoneuron loss in response to nerve avulsion, to 

PT treat nerve damage 

XX 

PS Disclosure; Fig 7; 65pp; English. 
XX 

CC The invention relates to the use of an insulin- like growth factor I 

CC (IGF-I) isoform, known as mechano-growth factor (MGF) , in the manufacture 

CC of a medicament for treating nerve damage in the peripheral nervous 

CC system, or for treating nerve damage by localising MGF at the site of 

CC damage. The nerve damage may include severing of a nerve. The treatment 

CC may be combined with another treatment (such as a polypeptide growth 

CC factor other than MGF) that prevents or diminishes degeneration of the 

CC target organ (for example, muscle) which the damaged nerve innervates, 

CC whereby the treatment of the muscle with MGF or a polynucleotide encoding 

CC MGF prevents or diminishes degeneration. The method is useful for 

CC treating neurological disorders, preferably motorneuron disorders. These 

CC methods can reduce motoneuron loss by 2 0% or greater in response to nerve 

CC avulsion. This sequence represents cDNA encoding the rabbit MGF. 

XX 

SQ Sequence 523 BP; 154 A; 129 C; 142 G; 98 T; 0 other; 



Query Match 66.2%; Score 356.8; DB 24; Length 523; 

Best Local Similarity 82.3%; Pred. No. 6.3e-91; 

Matches 43 6; Conservative 0; Mismatches 87; Indels 7; Gaps 2; 



Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGtGTGTGGACCA 60 

Mill lllll II Mill I f 1 1 1 1 1 1 1 1 1 1 1 J MMMMMIMIIIIMM 

Db 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCC^ 12 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 i 1 1 1 1 II II I Ml II II III 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 



Qy 121 ACGGG CATTGTGGATGAGTGTTG CTTCCGGAG CTGTGATCTGAGGAGG CTGGAGATGTAC 18 0 

II Mill MM II I III I Ml 1 1 1 IN 1 , 1 1 ! 1 1 III II I III II ,1 1 II I II i 

Db 121 ACAGG CATCGTGGATGAGTG CTGCTTCCGGAG CTGTGATCTGAGGAGGCTGGAGATGTAC 180 



Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

I II I II I I I I I I Mill MM II III MM llllllllllllll III 

Db 181 TGTGC^CCCCTCAAGCCGGC^yyVGGCAGCCCGCTCCGTCCGTGCCCAGCGCC^CACCGAC 24 0 

Qy 241 ATG CCCAAGACTCAGAAGTC C CAG C CCCTAT CGACACACAAGAAAAGGAAG CTG CAAAGG 3 00 

MIMIIIIIIIIIIIMI Mill I III II lllllllll MM II III 

Db 241 ATG C CCAAGACTCAGAAGTATCAG C CTCCAT CTACCAACAAGAAAATGAAGTCTCAGAGG 3 00 

Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGC^GGAAACAAGACCTA 36 0 

I I III II II Ml II MM 1 1 1 1 i I Mill 1 1 1 1 1 1 1 1 1 II Mill I Mill I III 

Db 301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

Qy 361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAG7WUVTGCCACGTCACCGCAAGATCCTTTG 42 0 

Ml II II MM II III I Mill 

Db 361 CAGGATGTAGGAAGAC CCTT CTGAGGAGTGAAGAAGGACAGG CCACCG CAGGACCCTTTG 42 0 

Qy 421 CTGCTTGAGCAACCTGCAAAACATCGGAA(^CCTGCCAAATATC^TAATGAGTTCAATA 48 0 

1 1 I llllll MM III MM lllllllll II 

Db 421 CTCTGCACAGTTACCTGTAAACATTGGAATAGCGGCCA AAAAATAAGTTTGATC 474 

Qy 481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTC 53 0 

lllllllll Mill III MMMMMMMMMMMMI 

Db 475 ACATTTCAAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 



RESULT 9 
AAT84893 

ID AAT84893 standard; cDNA ; 553 BP. 
XX 

AC AAT84893; 
XX 

DT 14-APR-1998 (first entry) 
XX 

DE Rabbit insulin like growth factor 1 encoding cDNA . 
XX 

KW Insulin like growth factor 1; IGF-1; Ec peptide; muscle disorder ; 

KW heart; neuromuscular disease; primer; ss . 

XX 

OS Oryctolagus cuniculus . 
XX 

FH Key Location/Qualifiers 

FT CDS 1 . .366 

FT /*tag= a 

FT /product = "IGF-1 " 

XX 

PN W09733997-A1. 
XX 

PD 18-SEP-1997. 
XX 

PF ll-MAR-1997; 97WO-GB00658 . 
XX 

PR ll-MAR-1996; 96GB- 0005124 . 
XX 

PA (UNLO ) ROYAL FREE HOSPITAL SCHOOL MED . 
XX 

PI Goldspink G; 



XX 

DR WPI; 1997-470877/43. 

DR P-PSDB; AAW23301. 
XX 

PT Use of insulin like growth factor I characterised by presence. of Ec 

PT peptide - to treat humans or animals, particularly muscle disorders, 

PT heart conditions or neuromuscular diseases 
XX 

PS Disclosure; Fig 3; 33pp; English. 
XX 

CC A use of insulin like growth factor I (IGF-1) has been developed, and 

CC is characterised by the presence of the Ec peptide, or a functional 

CC equivalent, in the treatment or therapy of a human or animal. The IGF-1 

CC polypeptide can be used to treat muscular disorders, e.g. Duchenne or 

CC Becker muscular dystrophy, autosomal dystrophies and related progressive 

CC skeletal muscle weakness and wasting, muscle atrophy in ageing humans, 

CC spinal cord injury induced muscle atrophy and neuromuscular diseases, 

CC and cardiac disorders, e.g. diseases where promotion of cardiac muscle 

CC protein synthesis is a beneficial treatment, cardiomyopathies and acute 

CC heart failure or insult, specifically myocarditis or myocardial 

CC infarction. It can also be used to promote bone fracture healing and 

CC maintenance of bone in old age. The present sequence encodes rabbit 

CC IGF-1 used in the present specification. 

XX 

SQ Sequence 553 BP; 159 A; 142 C; 147 G; 105 T; 0 other; 

Query Match 66.2%; Score 356.8; DB 18; Length 553; 

Best Local Similarity 82.3%; Pred. No. 6.4e-91; 

Matches 436; Conservative 0; Mismatches 87; Indels 7; Gaps 2; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

INN lllll II lllll Mllllllllllll Mill III Ml lllll Mill 

Db 31 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 90 

QY 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATC^ 120 

MMIMMM I III II I Mill III I II II Mill I II II I Mill MM III 

Db 91 AGGGGCTTTTATTTCAAC^GCCCACAGGATACGGCTCC^GC^GTCGGAGGGCACCTCAG 150 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II Mill MMIMMM III MUM lllll lllll I Mill MMIMMM I II 

Db 151 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 210 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

MM I I MMM MM! MM II III MM MMIMMIMM Ml 

Db 211 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 2 70 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGA 3 00 

I M M 1 1 II M 1 1 II II 1 1 II I M I I II II I II I M II I II II Mill 

Db 2 71 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAACAAGAAAATGAAGT 33 0 

Qy 3 01 AGAAGGAAAGGAAGTA(^CTTGAAGAACA(^GTAGAGGAAGTGC^GGAAAa^GACCTA 360 

I MMM MMIMMM MM MMM MMM MM MM 1 1 MMM MM III 

Db 331 AGAAGGAAAGGAAGTAt^TTTGAAGAAC^CAAGTAGAGGGAGTGCAGGAAAC^GAACTA 3 90 

Qy 361 CAGAATGTAGGAGGAGCCTCCCGAGGAAC^GAAAATGCCACGTCACCGCAAGATCCTTTG 42 0 

Ml llllllll M III I IMM I II I II I lllllli II llllll 

Db 3 91 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 450 



Qy 421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAA 48 0 

II I 1 1 1 1 1 1 III! Ml 1 1 1 1 II III 1 1 1 1 II 

Db 4 51 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA- AAAAATAAGTTTGATC 504 



481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTC 53 0 

lllllll 1 1 1 1 I II I II I II I llllll MINIM MINIM III 

505 ACATTTCAAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 553 

RESULT 10 
AAD06398 

ID AAD06398 standard; cDNA; 517 BP. 
XX 

AC AAD063 98; 
XX 

DT 10-AUG-2001 (first entry) 
XX 

DE Human IGF-I isoform mechano -growth factor (MGF) cDNA. 
XX 

KW Human; IGF-I isoform; Insulin- like Growth Factor- I; MGF; 

KW mechano -growth factor; neurological disorder; neurodegenerative disorder; 

KW amyotrophic lateral sclerosis; spinal muscular atrophy; muscular atrophy; 

KW poliomyelitis; post-polio syndrome; toxin; motoneurone disorder ; 

KW nerve damage; autosomal muscular dystrophy; diabetic neuropathy; 

KW sex-linked muscular dystrophy; peripheral neuropathy; 

KW Alzheimer's disease; Parkinson's disease; ss. 

XX 

OS Homo sapiens. 



XX 

-FH Key Location/Qualifiers 

FT CDS 1. .333 

FT /*tag= a 

FT /product= "Mechano -growth factor (MGF) " 

FT /note= "This region comprises exons 3-6. The CDS does 

FT not include start codon" 

FT /partial 

XX 



PN WO200136483-A1 . 
XX 

PD 25-MAY-2001. 
XX 

PF 15-NOV-2000; 2 000WO-GB043 54 . 
XX 

PR 15-NOV-1999; 99GB-0026968 . 
XX 

PA (UNLO ) UNIV COLLEGE LONDON. 
XX 

PI Goldspink G, Johnson I; 
XX 

DR WPI; 2001-355620/37. 

DR P-PSDB; AAE02447. 
XX 

PT Use of mechano -growth factor, an isoform of Insulin-like Growth 

PT Factor-I, capable of reducing motoneurone loss, in the manufacture of a 

PT medicament for the treatment of neurological disorder - 

XX 



Qy 
Db 



PS Claim 4; Page 49-50; 66pp; English. 
XX 

CC The present invention relates to use of mechano-growth factor (MGF) , 

CC an Insulin-like Growth Factor- I (IGF-I) isoform in the manufacture of a 

CC medicament for the treatment of neurological disorder. The MGF is capable 

CC of reducing motoneurone loss by 2 0% or greater in response to nerve 

CC avulsion, and effects motoneurone rescue, preferably adult motoneurone 

CC rescue. The MGF polynucleotide and polypeptide are useful in the 

CC manufacture of a medicament for the treatment of a neurological disorder, 

CC including a disorder of motoneurones and/or neurodegenerative disorder, 

CC e.g., amyotrophic lateral sclerosis, spinal muscular atrophy, progressive 

CC spinal muscular atrophy, infantile or juvenile muscular atrophy, 

CC poliomyelitis or post -polio syndrome, a disorder caused by exposure to a 

CC toxin, motoneurone trauma, a motoneurone lesion or nerve damage, an 

CC injury that affects motoneurones, motoneurone loss associated with aging, 

CC autosomal or sex-linked muscular dystrophy, diabetic neuropathy, 

CC peripheral neuropathies, Alzheimer's disease and Parkinson's disease. 

CC The present sequence is human IGF-I isoform MGF cDNA . MGF is a muscle 

CC isoform having extracellular (Ec) domain, hence also referred as 

CC IGF-I-Ec. The MGF protein comprises amino acid sequences encoded by 

CC nucleic acid sequence of IGF-I exons 4, 5 and 6 in the reading frame 

CC of MGF. 

XX 

SQ Sequence 517 BP; 150 A; 130 C; 139 G; 98 T; 0 other; 

Query Match 60.3%; Score 325.2; DB 22; Length 517; 

Best Local Similarity 81.2%; Pred. No. 5.4e-82; 

Matches 429; Conservative 0; Mismatches 88; Indels 11; Gaps 4; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Mill Mill II MIMIMMIIMIMMMMIMMMMMMMIII 

Db 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAG^ 12 0 

MINIMI IMMIIMIMMM 1 1 1 1 1 1 1 M E 1 1 1 lllllllll M III < 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCC^GCAGTCGGAGGGCGCCTCAG 12 0 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II Mill MM III II II llllllllllllllllllll 

Db 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 240 

II I II lllllll I IIIMIIIIII II MM MMIMIIIMM Ml 

Db 181 TGCGCACCCCTC^GCCTGCCAAGTCAGCTCG^ 240 

Qy 241 ATGCCC^GACTCAGAAGTCCCAGCCCCTATCGACAavCAAGAAAAGGAAGCTGCAAAGG 3 00 

lllllllllllll!! lllllll III II lllllll I MM II I 

Db 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAAC^ G 2 97 

Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGC^ 3 60 

MMMMMMMIMI llllllllllllllllllll IIMMIIIIIIIMI III 

Db 2 98 AGAAGGAAAGGAAGTAC^TTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 357 

Qy 361 CAGAATGTAGGAGGAG CCTC C CGAGGAACAGAAAATG CCACGTCACCG CAAGATCCTTTG 42 0 

III Mill II II III I Mill I II I II I lllllll II MUM 

Db 358 CAGGATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 416 



Qy 421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTC 480 

II I 1 1 1 1 1 II 1 1 1 II I 1 1 1 1 Illllll: 

Db ' 417 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 470 

Qy 481 TCATTTCAGAGATGGGCATTTCCCTC^TGAAATACACAAGTAAACAT 528 

Illllll I I I I I I M I M I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 471 ACATTT(^WVGAT-GGCATTTCCCC(^TGAAATACACAAGTAAACAT 517 



RESULT 11 
AAS16877 

ID AAS16877 standard; cDNA; 517 BP. 
XX 

AC AAS16877; 
XX 

DT 25-FEB-2002 (first entry) 
XX 

DE Human mechano- growth factor (MGF) cDNA. 
XX 

KW Human; mechano -growth factor; insulin-like growth factor I; IGF- I; MGF; 

KW neuroprotective; nerve damage; peripheral nervous system; nerve severing; 

KW muscle; neurological disorder; motoneuron loss; mot orneuron disorder; ss; 

KW nerve avulsion. 
XX 

OS Homo sapiens . 



XX 

FH Key Location/Qualifiers 

FT CDS 1. .333 

FT /*tag= a 

FT /product = "Human MGF" 

FT /partial 

FT /note= "No start codon" 

FT exon 1 . . 76 

FT /*tag= b 

FT /number^ 3 

FT exon 77. .259 

FT /*tag= c 

FT /number= 4 

FT exon 260 . .307 

FT /*tag= d 

FT /number = 5 

FT exon 308 . .330 

FT /*tag= e 

FT /number^ 6 

XX 



PN WO200185781-A2 . 
XX 

PD 15-NOV-2 001. 
XX 

PF 10-MAY-2001; 2001WO-GB02 054 . 
XX 

PR 10-MAY-2000; 2000GB-0011278 . 
XX 

PA (UNLO ) UNIV COLLEGE LONDON. 

PA (EGRI-) EAST GRINSTEAD MEDICAL RES TRUST. 

XX 

PI Goldspink G, Terenghi G; 



XX 

DR WPI; 2002-055585/07. 

DR P-PSDB; AAU10559. 
XX 

PT Use of insulin-like growth factor I {IGF- I) isoform known as 

PT mechano -growth factor which is encoded by IGF- I exons 4,5,6 and has 

PT ability to reduce motoneuron loss in response to nerve avulsion, to 

PT treat nerve damage 

XX 

PS Claim 11; Fig 5; 65pp; English. 
XX 

CC The invention relates to the use of an insulin-like growth factor I 

CC (IGF-I) isoform, known as mechano -growth factor (MGF) , in the manufacture 

CC of a medicament for treating nerve damage in the peripheral nervous 

CC system, or for treating nerve damage by localising MGF at the site of 

CC damage. The nerve damage may include severing of a nerve. The treatment 

CC may be combined with another treatment (such as a polypeptide growth 

CC factor other than MGF) that prevents or diminishes degeneration of the 

CC target organ (for example, muscle) which the damaged nerve innervates, 

CC whereby the treatment of the muscle with MGF or a polynucleotide encoding 

CC MGF prevents or diminishes degeneration. The method is useful for. 

CC treating neurological disorders, preferably motorneuron disorders. These 

CC methods can reduce motoneuron loss by 2 0% or greater in response to nerve 

CC avulsion. This sequence represents cDNA encoding the human MGF. 

XX 

SQ Sequence 517 BP; 150 A; 130 C; 139 G; 98 T; 0 other; 

Query Match 60.3%; Score 325.2; DB 24; Length 517; 

Best Local Similarity 81,2%; Pred. No. 5.4e-82; 

Matches 429; Conservative 0; Mismatches 88; Indels 11; Gaps 4; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Mill Mill II MIMMMMMMMIM lllllllll MINIMI! 

Db 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCT^ 12 0 

IMIMIIMI MMMM IMIM MIMIMIMM IMMIIII II III 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 120 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

M Mill Illllllllll Ml IMMIIII I III MM II I Mill II I III Ml 

Db 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 180 

Qy 181 TGTGTCCGCTG(^GCCTAOUVAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

II I I I MUM I Illllllllll II MM II MIIIIMM III 

Db 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCG 24 0 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACA 300 

MIIIIMM I Ml III I III MM Ml II lllllllll M I 

Db 241 ATGCCCAAGACCCAGAAGTATCAGCCCCC^TCTACC^ G 297 

Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAACAC^GTAGAGGAAGTGCAGGAAAa^GACCTA 360 

II Mill I II MMMM II Ml lllllllll I Ml II Ml II II MM II III Ml 

Db 2 98 AGAAGGAAAGGAAGTAC^TTTGAAGAAC^CAAGTAGAGGGAGTGCAGGAAACAAGAACTA 357 

Qy 3 61 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 420 

III Mill II II III I Mill I Ml II I MUM M IMIM 



Db 



358 CAGGATGTA-GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 416 



Qy 



421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCAATAATGAGTT 48 0 



Db 



417 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 470 




Qy 



481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACAT 528 



Db 




RESULT 12 
ABT09479 

ID ABT09479 Standard; DNA; 671 BP. 
XX 

AC ABT09479; 
XX 

DT 05-DEC-2002 (first entry) 
XX 

DE Phase-1 Rat CT gene SEQ ID No 567. 
XX 

KW Rat; toxicity study; rat toxic response gene; toxicological response; 

KW drug development; phase-1 rat CT gene; ds . 

XX 

OS Rattus sp. ■ 
XX 

PN WO200266682-A2 . 
XX 

PD 29-AUG-2002. 
XX 

PF 29-JAN-2002; 2 002WO-US02 935 . 
XX 

PR 29-JAN-2001; 2 001US-264933P . 
PR 26-JUL-2001; 2 001US-308161P . 
XX 

PA (PHAS-) PHASE-1 MOLECULAR TOXICOLOGY INC . 
XX 

PI Farris G, Hicken SH, Farr SB; 
XX 

DR WPI; 2002-674961/72. 
XX 

PT Evaluating the toxicity of an agent, useful in drug development or in 
PT determining toxicological responses to a new drug, by determining the 
PT expression of rat toxicologically relevant genes in the test animal in 
PT response to the test agent 
XX 

PS Disclosure; Page 245; 388pp; English. 
XX 

CC The invention relates to a method used for evaluating the toxicity of an 
CC agent comprising determining the expression of a rat toxic response 
CC gene(s) in the test animal in response to the agent. The method is useful 
CC in drug development, particularly for conducting toxicity studies and 
CC analysis before a new drug or compound is approved for human consumption 
CC or use. The method is also useful in determining toxicological responses 
CC to a new drug. This polynucleotide sequence represents a phase-1 rat CT 
CC gene of the invention. 
XX 



SQ Sequence 671 BP; 160 A; 181 C; 180 G; 150 T; 0 other; 



Query Match 55.6%; Score 299.6; DB 24; Length 671; 

Best Local Similarity 97.1%; Pred. No. le-74; 

Matches 305; Conservative 0; Mismatches 9; Indels 0; Gaps 0; 

Qy 1 GGACCAGAGACCGTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 14 0 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAATTCGTGTGTGGACCA 199 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGCACCACAG 12 0 

lllllll III MIMIII Mill Mill II Ml 1 1 III III II III III II llllll 1 1 

Db 2 00 AGGGGCTTTTACTTC^CAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCACAG 259 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

MMI M M M M I II I Ml I 1 1 II 1 1 MM I M 1 1 1 II III 1 1 I M I Ml M I 

Db 26 0 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 319 



Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

I I I I I II I I II II II j I I II I II II I I M I I I I II II II M I I II II I II II I I 

Db 32 0 TGTGCTCCGCTGAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 37 9 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAAAAGGAAGCTGCAAAGG 3 00 

I II II 1 1 II 1 1 1 II 1 1 II I II 1 1 II 1 1 II I II 1 1 II 1 1 1 1 1 II I II 1 1 1 M 1 1 1 M II 1 1 

Db 38 0 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAAAAGGAAGCTGCAAAGG 43 9 



Qy 3 01 AGAAGGAAAGGAAG 314 

MMMIMIMM 

Db 44 0 AGAAGGAAAGGAAG 453 



RESULT 13 
AAN70436 

ID AAN70436 standard; cDNA; 818 BP. 
XX 

AC AAN70436; 
XX 

DT 25-MAR-2003 (updated) 

DT 05-APR-1991 (first entry) 

XX 

DE Sequence encoding insulin-like growth factor 1A (IGF-1A) . 
XX 

KW Growth promoter; lactation enhancer; cell proliferation; ss. 
XX 

OS Homo sapiens. 
XX 

PN EP229750-A. 
XX 

PD 22-JUL-1987. 
XX 

PF 06-JAN-1987; 87EP-0870001 . 
XX 

PR 20-NOV-1986; 86US-0929671 . 
PR 07-JAN-1986; 86US-08 16662 . 
XX 

PA (UNIW ) UNIV WASHINGTON . 
XX 



PI Krivi GG, Rotwein PS; 
XX 

DR WPI; 1987-200203/29. 
XX 

PT New pre-pro-insulin-like growth factor-1 protein - obtd. by 

PT recombinant DNA procedures for use as growth promoters for 

PT enhancing lactation, for stimulating cell proliferation etc. 
XX 

PS Example; Fig 5; 59pp; English. 
XX 

CC A 42 base oligonucleotide corresponding to the DNA sequence encoding 

CC amino acids 10 to 23 of mature human IGF-I was synthesized (AAN70437) . 

CC The radiolabeled 42 mer was then employed to screen for IGF-I 

CC~ containing DNA sequences in a human liver cDNA library. Insulin- 

CC like growth factors -1A and -IB cDNAs were isolated from a human cDNA 

CC library by using lambdagt 11 (AAN70435, AAN70436) . The human IGF-1 

CC genomic gene was isolated and mapped. It encodes at. least two 

CC preproinsulin-like growth factor-1 proteins. An essentially pure 

CC proproinsulin-1 ike growth factor-1 protein comprising the sequence 

CC of amino acids shown in Figure six is claimed (AAP70277) . 

CC (Updated on 25-MAR-2003 to correct PA field.) 
XX 

SQ Sequence 818 BP; 232 A; 186 C; 187 G; 213 T; 0 other; 

Query Match 50.9%; Score 274.2; DB 8; Length 818; 

Best Local Similarity 76.0%; Pred. No. 1.7e-67; 

Matches 4 05; Conservative . 0; Mismatches 68; Indels 60; Gaps 3; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Mill Mill II 1 1 MM 1 1 1 lllllll MM III I llllllll I MUM 1 1 

Db 2 03 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 262 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGCACCAC^G 12 0 

I Mill Mill Mill I Mill I MM 1 1 MM III I Ml MM Mill II Ml 

Db 263 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 322 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II Mill 1 1 ! 1 1 i 1 1 1 1 1 1 1 1 1 1 IN 1 1 1 1 1 I f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 323 ACAGG CATCGTGGATGAGTG CTG CTTCCGGAG CTGTGAT CTAAGGAGG CTGGAGATGTAT 3 82 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCC^TCCGGGCCCAGCGCCACACTGAC 24 0 

III II M 1 1 II I I 1 1 1 M I II 1 1 1 1 1 MM 1 1 1 1 1 J 1 1 1 1 [ 1 1 F III 

Db 3 83 TGCGCACCCCTO^GCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCG^GCGCCACACCGAC 442 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACAC^ 3 00 

MINI Ml III 

Db • 44 3 ATGCCCAAGACCCAG 457 

Qy 301 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 3 60 

II Mill I II I I M MM 1 1 I Mi MM I Ml Ml 1 1 Ml II 1 1 III 

Db 458 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTA 510 

Qy 3 61 C^GAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 42 0 

III llllllll II Mill I MM I I II II I MM III Mill I Ml 

Db 511 CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATG C CAC CG CAGGATC CTTTG 570 

Qy 421 - - CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAA 478 



Db 571 CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCA AAAAATAAGTTTGA 624 

Qy 479 TATCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCC 531 

II Mill I llllllll llllll I MM 1 1 M I II 1 1 1 I II I II I 

Db 625 TAACATTTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTCC 677 

RESULT 14 
ABT11091 

ID ABT11091 standard; cDNA; 7260 BP. 
XX 

AC ABT11091; 
XX 

DT 04-DEC-2002 (first entry) 
XX 

DE Human breast cancer associated coding sequence SEQ ID NO: 1225. 
XX 

KW Human; breast specific gene; breast cancer; differential expression; 

KW cytostatic; gene therapy ; . gene ; ss. 

XX 

OS Homo sapiens. 
XX 

PN WO200259271-A2 . 
XX 

PD 01 -AUG- 2 002. 
XX 

PF 25-JAN-2002; 2 002WO-US02176 . 
XX 

PR 25-JAN-2001; 2 001US-263757P . 

PR 25-APR-2001; 2001US-286090P . 

PR 23-MAY-2001; 2001US-292517P . 
XX 

PA (GENE-) GENE LOGIC INC. 
XX 

PI Orr MS, Nation M, Diggans JC, Zeng W; 
XX 

DR WPI; 2002-674803/72. 
XX 

PT Diagnosing breast cancer in a patient comprises detecting the level of 

PT gene expression in cell or tissue samples, where a differential gene 

PT expression is indicative of breast cancer 
XX 

PS Claim 1; SEQ ID NO 1225; 260pp + Sequence Listing; English. 
XX 

CC The present invention relates to methods of diagnosing breast cancer in a 

CC patient, which comprise detecting the level of expression in a tissue 

CC sample of two or more genes selected from those shown in ABT09867- 

CC ABT11112, where a differential expression of the genes indicates breast 

CC cancer. The methods are useful in diagnosing, treating, detecting the 

CC progression, and in monitoring treatment of breast cancer in patients. 

CC The methods' are also useful as a screening tool for agents that modulate 

CC the onset or progression of breast cancer. The breast cancer genes may be 

CC used as diagnostic markers for the prediction or identification of the 

CC malignant state of breast tissue, for confirming the type and progression 

CC of cancer, and for drug screening and assays. The present sequence is a 

CC coding sequence of the invention. 



CC Note: The sequence data for this patent did not form part of the printed 

CC ' specification, but was obtained in electronic format directly from WIPO 
CC at ftp.wipo.int/pub.published__pct_sequences. 
XX 

SQ Sequence 7260 BP; 2330 A; 1415 C; 1240 G; 2275 T; 0 other; 

Query Match 50.9%; Score 274.2; DB 24; Length 7260; 

Best Local Similarity 76.0%; Pred. No. 3.6e-67; 

Matches 4 05; Conservative 0; Mismatches 68; Indels 60; Gaps 3; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

INN Mill II i 1 1 1 ill IN ! 1 1 1 i 1 1 1 II MINIM MINI 

Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 37 0 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGC^CCACAG 120 

I I I I I I I I I I I I I I I I II I I I I i I I I I I I I I I I I I I I II II I I I I I I II I I I I I 

Db 371 AGGGGCTTTTATTTCAAC^GCCCAC^GGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 43 0 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II 1:111 1 1 II I II I II I III 1 1 Ml I lllllllll MMMMMMMM 

Db 431 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 4 90 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

III II 1 1 M 1 1 1 I MMMMM II MM I Ml Mill II I III 

Db 4 91 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGAC^ 300 

IIMMIMM Ml 

Db 551 ATGCCCAAGACCCAG 565 

Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGG 360 

IMMMM lllllllll MMMMM 1 1 1 1 1 1 1 1 1 1 II I II I III 

Db 566 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTA 618 

Qy 361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 42 0 

III III Mill II Mill Mill I I II II I I III II I MM Mill 

Db 619 CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTG 678 

Qy 421 - - CTGCTTGAGCAACCTGCAAAACATCGGAAC^ 4 78 

MM Ml Mill MM I lllllllll III II III MM I 

Db 67 9 CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCA AAAAATAAGTTTGA 732 

Qy 479 TATC&TTTC^GAGATGGGCATTTC^ 531 

II Mill I I II Mill III Ml MM MMMMMMMM 1 1 III! 

Db 733 TAACATTTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTCC 785 



RESULT 15 
ABK84583 

ID ABK84583 standard; cDNA; 7260 BP. 
XX 

AC ABK84583 ; 
XX 

DT 14-AUG-2002 (first entry) 
XX 

DE Human cDNA , different ially expressed in granulocytic cells #1154. 
XX 



KW Human; ss; granulocytic cell; DNA chip; bacterial infection; 

KW viral infection; parasitic infection; protozoal infection; 

KW fungal infection; sterile inflammatory disease; psoriasis; 

KW rheumatoid arthritis; glomerulonephritis; asthma; thrombosis; 

KW cardiac reperfusion injury ;. renal reperfusion injury; ARDS; 

KW adult respiratory distress syndrome; inflammatory bowel disease ; 

KW Crohn's disease; ulcerative colitis; periodontal disease; 

KW granulocyte activation; chronic inflammation; allergy. 

XX , 

OS Homo sapiens . 
XX 

PN WO200228999-A2 . 
XX 

PD ll-APR-2002. 
XX 

PF 03-OCT-2001; 2001WO-US30821 . 
XX 

PR 03-OCT-2000; 2000US-237189P. 
XX 

PA (GENE- ) GENE LOGIC INC. 
XX 

PI Beazer-Barclay Y, Weissman SM, Yamaga S, Vockley J; 
XX 

DR WPI; 2002-435328/46. 
XX 

PT Detecting granulocyte activation by detecting differential expression 

PT of genes associated with granulocyte activation, which serves as 

PT diagnostic markers that is useful for monitoring disease states and 

PT drug toxicity 
XX 

PS Claim 1; SEQ ID No 1154; 114pp; English. 
XX 

CC The invention relates to detecting (Ml) granulocyte (GC) activation 

CC (GCA) , by detecting the level of expression of gene(s) (Gs) identified by 

CC DNA chip analysis as given in the specification, and comparing 

CC the expression level to an expression level in an unactivated 

CC GC, where differential expression of Gs is indicative of GCA. 

CC Also included, are modulating (M2) GA by contacting GC with an agent 

CC that alters the expression of at least one gene in Gs; (2) screening (M3) 

CC for an agent capable of modulating GCA or an inflammation (especially 

CC chronic) in a tissue, an allergic response in a subject, exposure of a 

CC subject to a pathogen or sterile inflammatory disease using the 

CC gene expression profile; (3) detecting (M4) an inflammation (especially 

CC chronic) in a tissue, an allergic response in a subject, exposure of a 

CC subject to a pathogen or sterile inflammatory disease, by detecting the 

CC level of expression in a sample of the tissue of gene(s) from Gs, where 

CC the level of expression of the gene is indicative of inflammation; 

CC (4) treating (M5) an inflammation (especially chronic) or in a tissue, 

CC an allergic response in a subject, exposure of a subject to a pathogen 

CC or sterile inflammatory disease, by contacting a tissue having 

CC inflammation with an agent that modulates the expression of gene(s) 

CC from Gs in the tissue. Ml is useful for detecting GCA; M2 is useful for 

CC modulating GA; M3 is useful for screening an agent capable of modulating 

CC GCA preferably in an inflammation in a tissue; M4 is useful for 

CC detecting an inflammation (especially chronic) in a tissue, an allergic 

CC response in a subject, exposure of a subject to a pathogen or sterile 

CC inflammatory disease (e.g. psoriasis, rheumatoid arthritis, 



CC glomerulonephritis, asthma, thrombosis, cardiac reperfusion injury, renal 

CC reperfusion injury, ARDS, adult respiratory distress syndrome, 

CC inflammatory bowel disease, Crohn's disease, ulcerative colitis, 

CC periodontal disease; also bacterial infection, viral infection, 

CC parasitic infection, protozoal infection, fungal infection and M5 is 

CC useful for treating one of the above conditions. The present 

CC sequence represents a gene differentially expressed in granulocytes. 

CC Note: The sequence data for this patent did not form part 

CC of the printed specification, but was obtained in electronic 

CC format directly from WIPO at 

CC ftp.wipo. int/pub/published_pct_sequences . 

XX 

SQ Sequence 7260 BP; 2330 A; 1415 C; 1240 G; 2275 T; 0 other; 



Query Match 50.9%; Score 274-2; DB 24; Length 7260; 

Best Local Similarity 76.0%; Pred. No. 3.6e-67; 

Matches 4 05; Conservative 0; Mismatches 68; Indels 60; Gaps 3; 



Qy 


l 


GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 

Mill MM! II II III III II III II Mill III IIIIIIIIIII lllllll 

GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 


60 


Db 


311 


370 


Qy 


61 


AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTC 

IMMI Mill MUM II Mill III 1 llllll MIMI Mill MM II III 

AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 


120 


Db 


371 


430 


Qy 




ALULjUL-A 1 1 Li 1 (aUA 1 OALr Ibll CjL 1 1 LLLiLjAUL. 1 Cj 1 CjA 1 L 1 UAObAObL 1 GbAbA 1 (j 1 AL 

M Mill MMMIIII! MM llllll MMMIIII II IMMI MIMI III 

ACAGGCAT CGTGGATGAGTG CTG CTTC CGGAG CTGTGATCTAAGGAGG CTGGAGATGTAT 


18 0 


Db 


431 


490 


Qy 


181 


TGTGTCCGCTGCAAGCCTAO^GTCAGCTCGTTCCATCCGGGCCCAGCGCC^CACTGAC 


240 


Db 


491 


III II lllllll 1. MINIM II MM Mill IMM III 

TGCGC^CCCCTCAAGCCTGCC^GTC^GCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 


550 


Qy 


241 


ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAA^ 


300 


Db 


551 


MIIMIMM III 

ATGCCCAAGACCCAG 


565 


Qy 


301 


AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGC^ 

IIIIIIIIMI MM M MIMIIIM IIIIIIIIIIIIIMI III 

. -AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTA 


360 


Db 


566 


618 


Qy 


361 


CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 

III MINIM II Mill Mill lllllll lllllll lllllllll 

CAGGATGTAGGAAGAC CCTCCTGAGGAGTGAAGAGTGACATGCCACCG CAGGATCCTTTG 


420 


Db 


619 


678 


Qy 


421 


- - CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCAATAATGAGTTt^ 

1 II 1 II 1 II II 1 II II 1 II 1 1 1 II 1 1 II 1 lllllllll 1 

CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCA AAAAATAAGTTTGA 


478 


Db 


679 


732 


Qy 


479 


TATCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCC 531 

II Mill 1 1 lllllll llllll IIIIIIIIIIIIIIIIIIIIIIIIII 

TAACATTTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTCC 785 




Db 


733 
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ALIGNMENTS 



RESULT 1 

US-09-142-583A-3 

; Sequence 3, Application US/09142583A 
; Patent No. 6221842 

GENERAL INFORMATION: 

APPLICANT: GOLDS PINK, GEOFFREY 

TITLE OF INVENTION: METHOD OF TREATING MUSCULAR DISORDERS 
NUMBER OF SEQUENCES: 11 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: NIXON & VANDERHYE P.C. 

STREET: 1100 NORTH GLEBE ROAD 

CITY: ARLINGTON 

STATE: VA 

COUNTRY: USA 

ZIP: 22201 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 



OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE : Pa tent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

. APPLICATION NUMBER: US/ 09/142 , 583A 

FILING DATE: 29-Oct-1998 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/GB97/00658 

FILING DATE: ll-MAR-1997 

APPLICATION NUMBER: GB 9605124.8 

FILING DATE: 11 -MAR- 19 96 
ATTORNEY/AGENT INFORMATION: 

NAME: SADOFF, B. J. 

REGISTRATION NUMBER: 36663 

REFERENCE / DOCKET NUMBER: 117-263 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 7038164000 

TELEFAX: 7038164100 
INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 553 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : both 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
FEATURE: 

NAME /KEY: CDS 

LOCATION: 1. .363 
SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
US-09-142-583A-3 



Query Match 66.2%; Score 356.8; DB 3; Length 553; 

Best Local Similarity 82.3%; Pred. No. 2.5e-109; 

Matches 436; Conservative 0; Mismatches 87; Indels 7; Gaps 2; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Mill Mill II Mill IIIIIIIIIIIIII MIMIIIIIIIIIIIIIIII 

Db 31 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 90 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTC 12 0 

I I I II I I II I I II I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I II I I I II 

Db 91 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 150 



Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II Mill I! I I I I I I i! I I I I: I I I I I I I I I I I I M M I I I Ml I III I I I 

Db 151 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 210 



Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

MM II llllll Mill MM II III MM II II MUM 1 1 II III 

Db 211 TGTGOVCCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 270 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGAC^ 3 00 

MIMMMMIMIMM lllllllllll II II I II II I II I II III 

Db 271 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCA^ 330 

Qy 301 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGG 360 

I III 1 1 MM II MM II MMMMMMIMIMM IIIIIIIIIIIIII II III 



Db 



331 AGAAGGAAAGGAAGTACATTTGAAGAAC^t^GTAGAGGGAGTGCAGGAAAC^GAACTA 3 90 



QY 



3 61 CAGAATGTAGGAGGAGC CT CCCGAGGAACAGAAAATG CCACGTCACCG CAAGATCCTTTG 42 0 



Db 




Db 



Qy 



421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCAATAATGAGTTCAATA 480 

II I MINI 1 1 1 1 III 1 1 1 1 II III 1 1 1 1 II 

451 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 504 



Db 



Qy 




RESULT 2 

US-09-142-583A-5 

; Sequence 5, Application US/09142583A 
; Patent No. 6221842 

GENERAL INFORMATION: 

APPLICANT: GOLDSPINK, GEOFFREY 

TITLE OF INVENTION: METHOD OF TREATING MUSCULAR DISORDERS 
NUMBER OF SEQUENCES: 11 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: NIXON & VANDERHYE P.C. 

STREET: 1100 NORTH GLEBE ROAD 

CITY: ARLINGTON 

STATE: VA 

COUNTRY: USA 

ZIP: 22201 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/142 , 583A 

FILING DATE: 29-Oct-1998 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/GB97/ 00658 

FILING DATE: ll-MAR-1997 

APPLICATION NUMBER: GB 9605124.8 

FILING DATE: ll-MAR-1996 
ATTORNEY/AGENT INFORMATION: 

NAME: SADOFF, B. J. 

REGISTRATION NUMBER: 36663 

REFERENCE/DOCKET NUMBER: 117-263 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 7038164000 

TELEFAX: 7038164100 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 553 base pairs 

TYPE: nucleic acid 

STRANDEDNESS: both 

TOPOLOGY: linear 



MOLECULE TYPE: cDNA 
FEATURE : 

NAME/ KEY: CDS 
LOCATION: 341..397 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-09-142-583A-5 

Query Match 66.2%; Score 356.8; DB 3; Length 553; 

Best Local Similarity 82.3%; Pred. No. 2.5e-109; 

Matches 436; Conservative 0; Mismatches 87; Indels 7; Gaps 2; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Mill Mill II Mill II III III II III I I I M I I I I' 

Db 31 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 90 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCC^ 12 0 

I ! 1 1 1 1 1 1 1 1 1 IMIIIIMIIMIII II IMIIIIIII IIIIIIMIMI Ml 

Db 91 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGG 150 

Qy 121 ACGGG CATTGTGGATGAGTGTTGCTTC CGGAG CTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II Mill MIIMIIMI IIMIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIMI 

Db 151 ACAGG CATCGTGGATGAGTG CTG CTT CCGGAG CTGTGATCTGAGGAGGCTGGAGATGTAC 210 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCC^ 24 0 

MM II MM II 1 1 1 1 1 Ml! II III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 

Db 211 TGTGCACCCCTCAAGCCGGCAAAGGC^GCCCGCTCCGTCCGTGCCC^GCGCCACACCGAC 270 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACAC^ 3 00 

MMMMIMMMIMI Mill I III II ' 1 1 1 1 1 1 1 1 1 MM II III 

Db 271 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACC^ 330 

Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTG CAGGAAACAAGAC CTA 360 

MMIMMMMIMM MIIIIIIIIIIIMMIM 1 1 M 1 1 1 1 M 1 1 1 1 1 1 III 

Db 331 AGAAGGAAAGGAAGTAC^TTTGAAGAAC^CAAGTAGAGGGAGTGC^GGAAAaUlGAACTA 3 90 

Qy 361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGayVGATCCTTTG 42 0 

III IIIIIMI II III I Mill I II I II I 1 1 I i 1 1 1 i 

Db 3 91 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGG C CACCG CAGGAC CCTTTG 4 50 

Qy 421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAAm^ 4 80 

II I Mill I MM III MM II III 1 1 1 1 II 

Db 451 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 504 

Qy 481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTC 530 

1 1 M 1 1 1 MM llllllllll I II IM N 1 1 1 1 1 1 1 Ml II I 

Db 505 ACATTTCAAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 553 



RESULT 3 

US-09-142-583A-10 

; Sequence 10, Application US/09142583A 
; Patent No. 6221842 

GENERAL INFORMATION : 

APPLICANT: GOLDSPINK, GEOFFREY 

TITLE OF INVENTION: METHOD OF TREATING MUSCULAR DISORDERS 
NUMBER OF SEQUENCES: 11 
CORRESPONDENCE ADDRESS : 



ADDRESSEE: NIXON & VANDERHYE P.C. 

STREET: 1100 NORTH GLEBE ROAD 

CITY: ARLINGTON 

STATE: VA 

COUNTRY: USA 

ZIP: 22201 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/ 142 , 583A 
-FILING DATE: 29-Oct-1998 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/GB97/00658 

FILING DATE: ll-MAR-1997 

APPLICATION NUMBER: GB 9605124.8 

FILING DATE: ll-MAR-1996 
ATTORNEY/AGENT INFORMATION: 

NAME: SADOFF, B. J . 

REGISTRATION NUMBER: 36663 

REFERENCE/DOCKET NUMBER: 117-263 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 7038164000 

TELEFAX: 7038164100 
INFORMATION FOR SEQ ID NO: 10: 
SEQUENCE CHARACTERI STI CS : 

LENGTH: 777 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : both 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
FEATURE : 

NAME/ KEY: CDS 

LOCATION: 26 . .493 
SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
US-09-142-583A-10. 

Query Match 50.6%; Score 272.6; DB 3; Length 777; 

Best Local Similarity 75.8%; Pred. No. 4.3e-81; 

Matches 404; Conservative 0; Mismatches 69; Indels 60; Gaps 3; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 



Db 




Qy 



61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTC 120 



Db 




QY 



121 ACGGGCATTGTGGATGAGTGTTGCTT CCGGAG CTGTGATCTGAGGAGG CTGGAGATGTAC 180 



Db 




Qy 



181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 



Ml II 1 1 II III I III II I II III II MM MMIMMIMM III 

5 9 TGCGC^CCCCTCAAGCCTGC(^GTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 



Db 


359 


TGCGCACCCCTCAAGCCTGCf^ 


418 


Qy 


241 


ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACAC^C^GAAAAGGAAGCTGCAAAGG 


300 


Db 


419 


INI III MM Ml 

ATGCCCAAGACCCAG 


433 


Qy 


301 


AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTC 


360 


Db 


434 


IMMMIMI II II MM 1 IMIMIMI MMMMMIIMM Ml 

AAGGAAGTACATTTGAAGAACG CAAGTAGAGGGAGTG CAGGAAACAAGAACTA 


486 


Qy 


361 


CAGAATGTAGGAGGAG CCT CCCGAGGAACAGAAAATG C CACGTCAC CG CAAGAT CCTTTG 
III 1 II 1 1 1 1 1 1 1 II 1 1 1 1 II II 1 1 M II 1 1 1 1 II II II II 1 II M 


420 


Db 


487 


1 11 1 1 II 1 1 1 1 II Mill 1 1 1 1 1 1 1 It II 1 1 1 1 1 1 1 t 1 1 1 1 1 l l l l 

CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTG 


546 


Qy 


421 


- - CTGCTTGAGCAACCTGCAAAACA.TCGGAACACCTGCCAAATATCAATAATGAGTTCAA 


478 


Db 


547 


MM III MM! MM 1 lllllllll Ml 1 1 Ml MM 1 

CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCA AAAAATAAGTTTGA 


600 


Qy 


479 


TATCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCC 531 




Db 


601 


II Mill 1 MMMM MUM IIMIMIIMIIIIIIIIMMIII 

TAACATTTAAAAGATGGG CG TTT C CC C CAATGAAATACA CAAGTAAACATT C C 653 





RESULT 4 
5405942-2 

; Patent No. 5405942 

APPLICANT: BELL, GRAEME I . ; RALL , LESLIE B, ; MERRYWEATHER , 
; JAMES P. 

TITLE OF INVENTION: PREPRO INSULIN-LIKE GROWTH FACTORS 
;I AND II 

NUMBER OF SEQUENCES: 16 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/65,673 
FILING DATE: 16-JUN-1987 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 630,557 
FILING DATE: 19-JUL-1984 
;SEQ ID NO: 2: 

LENGTH: 622 
5405942-2 

Query Match 50.3%; Score 271; DB 6; Length 622; 

Best Local Similarity 61.2%; Pred. No. 1.3e-80; 

Matches 326; Conservative 77; Mismatches 70; Indels 60; Gaps 3; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Mill Mill h MMMIhlMHhlll M M : M I h M h h h I M 

Db 45 GGACCGGAGACGCUCUGCGGGGCUGAGCUGGUGGAUGCUCUUCAGUUCGUGUGUGGAGAC 104 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGA 120 

lllllh:::| -llllllllllllll =h II hi II I II MM I II II II III 

Db 105 AGGGGCUUUUAUUUCAACAAGCCCACAGGGUA 164 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

M II h MM MM hi : Mil II hi III h hi. 

Db 165 ACAGGUAUCGUGGAUGAGUGCUGCUUCCGGAGCUGUGAUCAUAGGAGGCUGGAGAUGUAU 224 



Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

Ml I I lllllh I llhlllhll =1 :||| MIMIIIIIIIII Ml 

Db 225 UGCGC^CCCCUC^GCCUGCCAAGUCAGCUCGCUCUGUCCGUGCCCAGCGCCACACCGAC 284 

Qy 241 ATGCCCAAGAC^CAGAAGTCCCAGCCCCTATCGACACAC^^ 300 

hllMIIMI I I I 

Db 285 AUGCCCAAGACCCAG 299 

Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAAC^C^GTAGAGGAAGTGCAGGAAAa^GACCTA 360 

llllllhlll ■■Mill lllhlllll IhlMMMIIMM hi 

Db 300 AAGGAAGUACAUUUGAAGAACG CAAGUAGAGGGAGUG CAGGAAACAAGAACUA 352 

Qy 361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 420 

Ml hhllll II Ihll Mill I I =1 II I lllllll 

Db 3 53 CAGGAUGUAGGAAGACC CUCCUGAGGAGUGAAGAGUGACAUGC CACCG CAGGAUC CUUUG 412 

Qy 421 --CTGCTTGAGO^CCTGCAAAAC^ 478 

hi III Mhl MM : I III III h Ml II lh lh= I 

Db 413 CUCUGCACGAGUUACCUGUUAAACUUUGGAACACCUACCA AAAAAUAAGUUUGA 466 



Qy 47 9 TATCATTTCAGAGATGGGC^TTTCCCTC^TGAAATAC^a^GTAAACATTCC 531 

= I lh:: | ||h|||| :::||| | | |: || | |: || | | | | |: | | || J: : | | 
Db 4 67 UAACAUUUAAAAGAUGGGCGUUUC CC CCAAUGAAAUAC^ 519 



RESULT 5 

US-08-472-809B-8 

; Sequence 8, Application US/08472809B 

; Patent No. 5925564 

; GENERAL INFORMATION: 

APPLICANT: Schwartz, Robert J. 

APPLICANT: DeMayo , Franco J. 

APPLICANT: O'Malley, Bert W. 

TITLE OF INVENTION: Expression Vector Systems and 
TITLE OF INVENTION: Method of Use 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Lyon & Lyon 

STREET: 633 West Fifth Street 

STREET : Suite 470 0 
; CITY: Los Angeles 

STATE: California 

COUNTRY: U.S.A. 

ZIP: 90071-2066 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5" Diskette, 1.44 Mb 

MEDIUM TYPE: storage 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: IBM P.C. DOS 5.0 

SOFTWARE: Word Perfect 5.1 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/472 , 809B 

FILING DATE: June 7, 1995 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/209,846 



FILING DATE: March 9, 1994 

APPLICATION NUMBER: 07/789,919 

FILING DATE: No . 5925564ember 6, 1991 
ATTORNEY/AGENT INFORMATION: 

NAME: Warburg, Richard J. 

REGISTRATION NUMBER: 32,3 27 

REFERENCE/DOCKET NUMBER: 214/212 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (213) 489-1600 

TELEFAX: (213) 955-0440 

TELEX: 67-3510 
; INFORMATION FOR SEQ ID NO: 8: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 5707 bases 

TYPE: nucleic acid 

STRANDEDNESS: double 

TOPOLOGY: linear 
MOLECULE TYPE: CDNA 
US-08-472-809B-8 

Query Match 41.2%; Score 222; DB 2; Length 5707; 

Best Local Similarity 75:5%; Pred. No. l.le-63; 

Matches 314; Conservative 0; Mismatches 50; Indels 52; Gaps 1; 



Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Mill Mill M MM I II III III II III II M I !! I ! 1 1 1 1 1 1 1 1 1 1 i 1 1 1 

Db 7 93 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 8 52 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTC 12 0 

II, llllll MINIMI III : 1 1 ■ 1 1 1 1 1 1 1 1 1 MINIM II III 

Db 8 53 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 912 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

M MIM MIIMII M I II Ml I II 1 1 II M 1 . 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 

Db 913 ACAGG CAT CGTGGATGAGTGCTG CTTCCGGAG CTGTGATCTAAGGAGGCTGGAGATGTAT 972 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

II I I I II i 1 I II! 1 1 i 1 1 1 1 II MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 

Db 973 TGCGCACCCCTCAAGCCTGCC^AGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 1032 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACAC 300 

IIIIIIIIMI III 

Db 1033 ATGCCCAAGACCCAG 1047 

Qy 3 01 AGAAGGAAAGGAAGTAOVCTTGAAGAACAO^GTAGAGGAAGTGCAGGAAACAAGACCTA 360 

MIIMIMM I i 1 1 1 1 1 1 1 I i 1 1 1 1 1 1 1 i 1 1 M 1 1 1 III 

Db 1048 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTA 1100 

Qy 361 CAGAATGTAGGAGGAGCCT CC CGAGGAACAGAAAATG C CACGTCACCG CAAGATC C 416 

III I Mill M Mill Mill I II II I I Mill III I 

Db 1101 CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCC 1156 



RESULT 6 

US-08-472-809B-7 

; Sequence 7, Application US/08472809B 
; Patent No. 5925564 



; GENERAL INFORMATION: 

APPLICANT: Schwartz, Robert J . 
APPLICANT: DeMayo, Franco J. 
APPLICANT: O'Malley, Bert W. 

TITLE OF INVENTION: Expression Vector Systems and 
TITLE OF INVENTION: Method of Use 
NUMBER OF SEQUENCES : 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Lyon & Lyon 

STREET: 633 West Fifth Street 

STREET: Suite 4700 

CITY: Los Angeles 

STATE: California 
■ COUNTRY: U.S.A. 

ZIP: 90071-2066 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5" Diskette, 1.44 Mb 
; MEDIUM TYPE: storage 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: IBM P.C. DOS 5.0 

SOFTWARE: Word Perfect 5.1 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08 /472 , 8 09B 

FILING DATE: June 7, 1995 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/209,846 

FILING DATE: March 9, 1994 

APPLICATION NUMBER: 07/78 9,919 

FILING DATE: No. 5925564ember 6, 1991 
ATTORNEY/AGENT INFORMATION: 

NAME: Warburg, Richard J . 

REGISTRATION NUMBER: 32,327 

REFERENCE/DOCKET NUMBER: 214/212 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (213) 489-1600 

TELEFAX:" (213) 955-0440 

TELEX: 67-3510 
; INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 6345 bases 

TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
US-08-472-809B-7 

Query Match 41.2%; Score 222; DB 2; Length 634 5; 

Best Local Similarity 75.5%; Pred. No. 1.2e-63; 

Matches 314; Conservative 0; Mismatches 50; Indels 52; Gaps 1; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 6 0 

Mill Mill II i I I I M l I IMIIIMMMMMMMI 

Db 3702 GGACCGGAGACGCT CTG CGGGG CTGAG CTGGTGGATGCTCTT CAGTTCGTGTGTGGAGAC 3 761 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCC^ 12 0 

1 1 1 1 1 1 1 1 II I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIMIMIIMM 1 1 Ml II 1 1 II III 



Db 



3762 AGGGGCTTTTATTTCAA(^GCCC^CAGGGTATGGCTCC^GCAGTCGGAGGGCGCCTCAG 3821 



Qy 121 ACGGGCATTGTGGATGAGTGTTG CTT C CGGAG CTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

I I 1 11 M lllllllllll I i 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 II Mill III II I II II 

Db 3 822 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 38 81 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCGCAGCGCCACACTGAC 24 0 

Ml II lllllll I MIMMIMI II MM IIMIIIIMIMI III 

Db 3882 TGCGCACCCCTCAAGCCTGCCAAGTGAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 3 941 

Qy 241 ATG CCCAAGACT CAGAAGTC CCAG CCCCTATCGACACACAAGAAAAGGAAGCTG CAAAGG 3 00 

lllllllllll III 

Db 3942 ATGCCCAAGACCCAG 3956 

Qy 301 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTG CAGGAAACAAGAC CTA 3 60 

IIIIMIIIII lllllllll IIIIIIIMI lllllll llllllll III 

Db 3 957 AAGGAAGTACA.TTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTA 4 009 

Qy 361 OVGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCC 416 

III IIIMMI II Mill Mill I I II M I MM III Mill 

Db 4 010 CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCG CAGGATC C 4 065 



RESULT 7 
5405942-9 

/Patent No. 5405942 

APPLICANT: BELL, GRAEME I.;RALL, LESLIE B . ; MERRYWEATHER , 
; JAMES P. 

TITLE OF INVENTION: PREPRO INSULIN-LIKE GROWTH FACTORS 
;I AND II 

NUMBER OF SEQUENCES: 16 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/65,673 
FILING DATE: 16-JUN-1987 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 63 0,557 
FILING DATE: 19-JUL-1984 
;SEQ ID NO: 9: 

LENGTH: 3 57 
5405942-9 

Query Match 38.1%; Score 205.2; DB 6; Length 357; 

Best Local Similarity 70.2%; Pred. No. 9e-59; 

Matches 181; Conservative 44; Mismatches 33; Indels 0; Gaps 0; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Mill Mill h MlllllhlllhlhlllllhhMlhMhhhIM 

Db 43 GGACCGGAGACGCUCUG CGGGG CUGAG CUGGUGGACGCUCUUCAGUUCGUGUGUGGAGAC 102 



Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGCACCACAG 12 0 

|||l||::::| ^llllllllllllll M M I h MM I I M I M I I I I II I I I 
Db 103 AGGGGCUUUUAUUUCAACAAGCCCACAGGGUAUGGCUCf^ 162 



Qy 121 ACGGGCATTGTGGATGAGTGTTG CTT CCGGAG CTGTGATCTGAGGAGG CTGGAGATGTAC 18 0 

II II h I = I : : 

Db 163 ACAGGUAUCGUGGAUGAGUGCUGUUUCCGGAG CUGUGAUCUAAGGAGG CUGGAGAUGUAU 222 



Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

•MM IMIIIM MhlilhlMI MM M : ' 1 1 M 1 1 1 1 1 1 IN 

Db 223 UGCGCACCCCUCAAGCCUGCCAAGUCAGCUCGCUCUGUCCGUGCCCAGCGCCACACCGAC 282 

Qy 241 ATGCCCAAGACTCAGAAG 258 

: I I M I I I llllll 

Db 283 AUGCCCAAGACCCAGAAG 300 



RESULT 8 

5405942-13 

/Patent No. 5405942 

APPLICANT: BELL, GRAEME I.;RALL, LESLIE B . ; MERRYWEATHER , 
; JAMES P. 

TITLE OF INVENTION: PREPRO INSULIN -LIKE GROWTH FACTORS 
;I AND II 

NUMBER OF SEQUENCES: 16 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/65,673 
FILING DATE: 16-JUN-1987 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 630,557 
FILING DATE: 19-JUL-1984 
;SEQ ID NO: 13 : 

LENGTH: 357 
5405942-13 

Query Match 37.8%; Score 2 03.6; DB 6; Length 3 57; 

Best Local Similarity 86.8%; Pred. No. 3.1e-58; 

Matches 224; Conservative 0; Mismatches 34; Indels 0; Gaps 0; 



Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Mill Mill [I 1 1 1 1 1 1 i i I 1 1 1 1 1 1 1 ; : I I II 1 1 1 1 1 1 1 1 1 1 Ml 

Db 43 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 102 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGCACCACAG 12 0 

lllllllll IMIIM MM I llllll llllll IMIIM Mill MM MM 

Db 103 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 162 

Qy 121 ACGGG CATTGTGGATGAGTGTTG CTTCCGGAG CTGTGATCTGAGGAGG CTGGAGATGTAC 180 

M II M MMMIIMI I llllll llllll II II III 1 1 1 i 1 1 1 1 1 i 1 1 1 1 1 1 1 

Db 163 ACAGGTATCGTGGATGAGTG CTG CTT CCGGAG CT.GTGAT CTAAGGAGG CTGGAGATGTAT 222 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCC^ 24 0 

III II llllllllllllllll I I I I E I I I I I I I I I i I 1 I III 

Db 223 TGCGO^CCCCTCAGGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 2 82 

Qy 241 ATGCCCAAGACTCAGAAG 2 58 

MMMIIMI WWW 

Db 2 83 ATGCCCAAGACCCAGAAG 3 00 



RESULT 9 
5405942-7 

; Patent No. 54 05942 

APPLICANT: BELL, GRAEME I . ; RALL , LESLIE B .; MERRYWEATHER , 
; JAMES P. 



TITLE OF INVENTION: PREPRO INSULIN- LIKE GROWTH FACTORS 
I AND II" 

NUMBER OF SEQUENCES: 16 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/65,673 
FILING DATE: 16-JUN-1987 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 63 0,557 
FILING DATE: 19-JUL-1984 
SEQ ID NO: 7: 

LENGTH: 210 
5405942-7 

Query Match 30.9%; Score 166.8; DB 6; Length 210; 

Best Local Similarity 67.6%; Pred. No. 4.5e-46; 

Matches 142; Conservative 41; Mismatches 27; Indels 0; Gaps 0; 

Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

INI! Mill h MMMMMIMMIMM IhhMlhMIMMMII 
Db 1 GGAC CGGAGACG CUCUGCGGGG CUGAGCUGGUGGAUG CUCUUCAGUUCGUGUGUGGAGAC 60 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGC 120 

lllh-. ::|MIIIIIIIIIII MMMMIMM MINIMI II Ml 
Db 61 AGGGGCUUUUAUUU(^CAAGCCCA(^GGGUA 120 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

Illlh IMIIMIIM : I h : I i 1 I I I I : I : I I : I : I I I I I II M I I I I M M 
Db 121 ACAGGUAUCGUGGAUGAGUGCUGCUUCCGGAGCUGUGAUCUAAGGAGGCUGGAGAUGUAU 180 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCT 210 

Ml II lllllh I 

Db 181 UGCGCACCCCUCAAGCCUGCCAAGUCAGCU 210 



RESULT 10 

5405942-11 

; Patent No. 5405942 

APPLICANT: BELL, GRAEME I . ; RALL , LESLIE B . ; MERRYWEATHER , 
; JAMES P. 

TITLE OF INVENTION: PREPRO INSULIN- LIKE GROWTH FACTORS 
;I AND II 

NUMBER OF SEQUENCES : 16 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/65,673 
FILING DATE: 16-JUN-1987 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 630,557 
FILING DATE: 19- JUL-1984 
;SEQ ID NO: 11: 

LENGTH: 210 
5405942-11 



Query Match 30. 9%; 

Best Local Similarity 87.1%; 
Matches 183; Conservative 



Score 166.8; DB 6; Length 210; 
Pred. No. 4.5e-46; 
0; Mismatches 27; Indels 0; Gaps 0; 



Qy 



1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 



I Ml I Mill 1 1 1 1 1 1 1 M 1 1 1 1 1 1 ' 1 1 MMMMIMMIMIMM 

Db 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGCACCACAG 12 0 

MM I MM II llllllllllllllll 1 . ! 1 1 1 1 1 1 1 1 1 1 III M MM II III 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 120 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

II II II Illllllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i II Mill I Mill II II 

Db 121 ACAGGTATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 180 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCT 210 

III I I MINI I I III I MM 

Db 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCT 210 



RESULT 11 
US-09-255-829-13 

; Sequence 13, Application US/09255829 

; Patent No. 6461617 

; GENERAL INFORMATION: 

APPLICANT: Shone, Clifford Charles 

APPLICANT: Quinn, Conrad Padraig 

APPLICANT: Foster, Keith Alan 

TITLE OF INVENTION: . Recombinant Toxin Fragments 
NUMBER OF SEQUENCES: 2 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: STERNE, KESSLER, GOLDSTEIN, & FOX P.L.L.C. 

STREET:. 1100 NEW YORK AVENUE, NW, SUITE 600 

CITY: WASHINGTON 

STATE: DC 

COUNTRY: USA 

ZIP: 20005-3934 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patent In Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/255 , 829 

FILING DATE: 23-FEB-1999 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/GB97/02273 

FILING DATE: 22-AUG-1997 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/782,893 

FILING DATE: 27-DEC-1996 
ATTORNEY/AGENT INFORMATION: 

NAME: ESMOND, ROBERT W. 

REGISTRATION NUMBER: 32,893 

REFERENCE / DOCKET NUMBER: 1581.0130002 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 202-371-2600 

TELEFAX: 202-371-2540 
; INFORMATION FOR SEQ ID NO: 13: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 28 62 base pairs 



TYPE: nucleic acid 
STRANDEDNESS : doubl e 
TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
FEATURE : 

NAME/ KEY: CDS 
LOCATION: 1. .2862 
US-09-255-829-13 

Query Match 30.9%; Score 166.8; DB 4; Length 2862; 

Best Local Similarity 87.1%; Pred. . No . '■ 2 . le-45 ; 

Matches 183; Conservative 0; Mismatches 27; Indels 0; Gaps 0; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Mill Mill II I ! M 1 1 1 1 1 1 II 1 1 1 IIIIIIIIMIII Mill 

Db 2644 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 27 03 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACA^ 12 0 

IIIIIIIIMI HIM Illllll III IIIIIIIMIIIII.il II ill 

Db 27 04 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTC^ 2763 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

M M II Illllll I I I f I i t i I I I I I E I I I t I I Illllll Illllll 

Db 2764 ACAGGTAT CGTGGATGAGTG CTG CTTCCGGAG CTGTGAT CTAAGGAGG CTGGAGATGTAT 2823 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCT 210 

II I I I Illllll I lllllllll 

Db 2824 TGCGCACCCCTCAAGCCTGCCAAGTCAGCT 2853 



RESULT 12 

5405942-15 

;Patent No. 5405942 

APPLICANT: BELL, GRAEME I . ; RALL , LESLIE B . ; MERRYWEATHER , 
; JAMES P. 

TITLE OF INVENTION: PREPRO INSULIN-LIKE GROWTH FACTORS 
;I AND II 

NUMBER OF SEQUENCES: 16 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/65,673 
FILING DATE: 16-JUN-1987 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 63 0,557 
FILING DATE: 19 -JUL- 1984 
;SEQ ID NO: 15: 

LENGTH: 210 
5405942-15 

Query Match 30.6%; Score 165.2; DB 6; Length 210; 

Best Local Similarity 67.1%; Pred. No. 1.5e-45; 

Matches 141; Conservative 41; Mismatches 28; Indels 0; Gaps 0; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Mill Mill h MlllllhilihIMIM 

Db 1 GGACCGGAGACGCUCUGCGGGGCUGAGCUGGUGGAUGCUCUUCAGUUCGUGUGUGGAGAC 60 



QY 



61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGCACCACAG 12 0 



I Ml II-- I : I I I I :hllhliril| U II Mill II III 

Db 61 AGGGGCUUUUAUIJUCAACAAGCCCAC^GGGUAU^ 120 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II M h hllhllhl U h : I I I I I I h I : I I : I : IIIIIIIMMIIMM 

Db 121 ACAGGUAUCGUGGAUGAGUG CUG CUUCCGGAG CUGUGAUCUAAGGAGGCUGGAGAUGUAU 18 0 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCT 210 

Ml II M h I I hlllh 

Db 181 UGCGCACCCCUCAGGCCUGCCAAGUCAGCU 210 



RESULT 13 
US-08-308-196A-1 

Sequence 1, Application US/08308196A 
Patent No. 5612198 
GENERAL INFORMATION: 

APPLICANT: Brierley, Russell A. 
APPLICANT: Davis, Geneva R. 
APPLICANT: Holtz, Gregory C. 
APPLICANT: Gleeson, Martin A. 
APPLICANT: Howard, Bradley D. 

TITLE OF INVENTION: Production of Insulin-Like Growth 
TITLE OF INVENTION: Factor-1 in Methyl otrophic Yeast Cells 
NUMBER OF SEQUENCES : 17 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Brown, Martin, Haller & McClain 
STREET: 1660 Union Street 
CITY: San Diego 
STATE: California 
COUNTRY : USA 
ZIP: 92101-2926 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/3 08 , 196A 
FILING DATE: 09-SEPT-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/983,523 
FILING DATE: 03-MAR-1993 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/578,728 
FILING DATE: 04-SEP-1990 
ATTORNEY/AGENT INFORMATION: 
NAME: Seidman, Stephanie L. 
REGISTRATION NUMBER: 33,779 
REFERENCE / DOCKET NUMBER: 518 75 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: ( 6 1 9 ) 23 8 - 0 9 9 9 
TELEFAX: (619)238-0062 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 24 0 base pairs 
TYPE: nucleic acid 



STRANDEDNESS : double 
TOPOLOGY: unknown 
MOLECULE TYPE : cDNA 
FEATURE: 

NAME/ KEY: CDS 
LOCATION: 14.. 232 
US-08-308-196A-1 

Query Match 29.9%; Score 161.2; DB 1; Length 240; 

Best Local Similarity 84.6%; Pred. No. 3.6e-44; 

Matches 181; Conservative 0; Mismatches. 33; Indels 0; Gaps 0; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Mill Mill II Mill 1 1 MM III Mill Mill MM lllllllllll 

Db 17 GGACCGGAGACGCTCTGCGGGGCTGAGCTCGTGGATGCTCTGCAGTTCGTGTGTGGAGAC 76 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATC^ 120 

llllllll MM I; 1 1 1 ■ 1 1 II I 1 1 1 1 1 1 E 1 1 1 1 1 1 III MM II III 

Db 7 7 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCC^GCAGTCGACGGGCGCCTCAG 136 

Qy 121 ACGGG CATTGTGGATGAGTGTTG CTT CCGGAG CTGTGATCTGAGGAGG CTGGAGATGTAC 180 

II MM I II llllllll IMMMM llllll II II Ml I llllllll 

Db "13 7 ACAGG CAT CGTGGATGAGTG CTG CTT CCGGAG CTGTGATCTAAGGAGGCT CGAGATGTAT 196 

Qy 181 TGTGTC CG CTG CAAG CCTACAAAGTCAGCTCGTT 214 

III II I II I II I I 1 1 II 1 1 1 M 1 I 

Db 197 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTTGAT 23 0 



RESULT 14 
PCT-US91-06452-1 

Sequence 1, Application PC/TUS9106452 
GENERAL INFORMATION: 

APPLICANT: Brierley, Russell A. 
APPLICANT: Davis, Geneva R. 
APPLICANT: Holtz, Gregory C. 
APPLICANT: Gleeson, Martin A. 
APPLICANT: Bradley, D. H. 

TITLE OF INVENTION: Production of Insulin-Like Growth 
TITLE OF INVENTION: Factor-1 in Methyl otrophic Yeast Cells 
NUMBER OF SEQUENCES : 12 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fitch, Even, Tabin & Flannery 
STREET: 135 South LaSalle Street, Suite 900 
CITY: Chicago 
STATE: Illinois 
COUNTRY: USA 
ZIP: 60603- 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 
SOFTWARE : . Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US91/06452 
FILING DATE: 19910409 
CLASSIFICATION: 435 



PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/578,728 
FILING DATE: 04-SEP-1990 
ATTORNEY/ AGENT INFORMATION: 
NAME: Seidman, Stephanie L. 
REGISTRATION NUMBER: 33,779 
REFERENCE/DOCKET NUMBER: 51874 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (619)552-1311 
TELEFAX: ( 6 1 9 ) 552 - 0 095 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 240 base pairs 
TYPE: NUCLEIC ACID 
STRANDEDNESS : doubl e 
TOPOLOGY : unknown 
MOLECULE TYPE: cDNA 
FEATURE : 

NAME /KEY: CDS 
LOCATION: 14. .232 
PCT-US91-06452-1 

Query Match 29.9%; Score 161.2; DB 5; Length 240; 

Best Local Similarity 84.6%; Pred. No. 3.6e-44; 

Matches 181; Conservative 0; Mismatches 33; Indels 0; Gaps 0; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Mill Mill II I III III III MM III II Mill Mill MINI MM 

Db 17 GGACCGGAGACGCTCTGCGGGGCTGAGCTCGTGGATGCTCTGCAGTTCGTGTGTGGAGAC 76 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTAT 12 0 

Illllllllll MMMMMIIMM MIMIMIMM III MM II III 

Db 77 AGGGGCTTTTATTTC^^CAAGCCCACAGGGTATGGCTCCAGCAGTCGACGGGCGCCTCAG 136 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II Mill III Mill II Mllllllll MM MIIIMI 1 1 1 1 1 1 1 1 

Db 13 7 ACAGGCAT CGTGGATGAGTGCTG CTTCCGGAG CTGTGAT CTAAGGAGG CTCGAGATGTAT 196 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTT 214 

II I II I I M I II I I II I II I I I I I 

Db 197 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTTGAT 23 0 



RESULT 15 
US-09-029-267-13 

Sequence 13, Application US/09029267 
Patent No. 6107057 
GENERAL INFORMATION : 

APPLICANT: Crawford, Kenneth 
APPLICANT: Zaror, Isabel 
APPLICANT: Innis, Michael 

TITLE OF INVENTION: Pichia Secretory Leader for Protein 
TITLE OF INVENTION: Expression 
NUMBER OF SEQUENCES: 4 0 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Chiron Corporation 
STREET: 4 560 Horton Street 



CITY: Emeryville 

STATE : California 

COUNTRY: United States 

ZIP : 94608 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA : 

APPLICATION NUMBER: US/09/ 02 9 , 267 

FILING DATE: 

CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 

NAME: Chung, Ling-Fong 

REGISTRATION NUMBER: 36,482 

REFERENCE/DOCKET NUMBER: 1165.100 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (510) 601-2704 

TELEFAX: (510) 655-3542 
; INFORMATION FOR SEQ ID NO: 13: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 3 90 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 
MOLECULE TYPE: other nucleic acid 

DESCRIPTION: /desc = "Synthetic" 
US-09-029-267-13 

Query Match 29.9%; Score 161.2; DB 3; Length 390; 

Best Local Similarity 84.6%; Pred. No. 4.8e-44; 

Matches 181; Conservative 0; Mismatches 33; Indels 0; Gaps 0; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Mill lllll II Ml III llllllll lllll Mill lllllllllllllll 

Db 160 GGACCGGAGACGCTCTGCGGGGCTGAGCTCGTGGATGCTCTGCAGTTCGTGTGTGGAGAC 219 

Qy 61 AGGGGCTTTTACTTCAACAAGCCC7\.(^GTCTATGGCTCC 12 0 



Db 



220 




279 



121 



ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 



180 



Db 



280 




339 



QY 



181 



TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTT 214 



340 




Search completed: December 13, 2003, 11:44:49 
Job time : 49.8379 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2 003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table : 



Searched: 



December 13, 2003, 07:29:55 ; Search time 240.655 Seconds 

(without alignments) 
7443.919 Million cell updates/sec 

US-09-852-261-3 
539 

1 ggaccagagaccctttgcgg agtaaacattcccggaattc 539 

I DENTI TY_NUC 

Gapop 10.0 , Gapext 1.0 



2201672 seqs, 1661799599 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



4403344 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Publ ished_Applications__NA: * 

1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB.seq: * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB. seq: * 

3 : /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB. seq: * 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB. seq: * 

5: /cgn2_6/ptodata/2/pubpna/US07__NEW_PUB.seq:* 

6 : /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB . seq : * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB. seq: * 

8: /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB. seq: * 

9 : /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB. seq: * 

10 : /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB. seq: * 

11 : /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB. seq: * 

12: /cgn2_6/ptodata/2/pubpna/US09_NEW__PUB.seq:* , 

13 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB. seq2 : * 

14 : /cgn2_6/ptodata/2/pubpna/US10A_PUBCOMB. seq: * 

15 : /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB . seq : * 

16: / cgn2_6 /p t oda t a / 2 /pubpna /US 1 0_NEW_PUB .seq:* 

17: /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB.seq:*. 

18 : /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB. seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-09-852-261-3 

; Sequence 3, Application US/09852261 

; Patent No. US20020083477A1 

; GENERAL INFORMATION: 

; APPLICANT: GOLDS PINK, GEOFFREY 



APPLICANT: TERENGHI , GIORGIO 
TITLE OF INVENTION: REPAIR OF NERVE DAMAGE 
FILE REFERENCE: 117-351 

CURRENT APPLICATION NUMBER: US/09/8 52 , 261 
CURRENT FILING DATE: 2001-05-10 
PRIOR APPLICATION DUMBER: GB 0011278.9 
PRIOR FILING DATE: . 2000-05-10 
NUMBER OF SEQ ID NOS : 14 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 3 
LENGTH: 53 9 
TYPE: DNA 

ORGANISM: Rattus sp . 
US-09-852-261-3 

Query Match 100.0%; Score 539; DB 9; Length 539; 

Best Local Similarity 100.0%; Pred. No. 8.4e-169; 

Matches 53 9; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

I I I I II I I II 1 I I i I I I I I I I II I I I I I I II I I I I I I I I M I I I II I I I II I I I I I I M I 

Db 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCT^ 120 

1 1 ; I Mil I ; 1 1 I ; 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 : 1 1 1 1 1 ! 1 1 1 1 1 1 1 ! 1 1 

Db 61 AGGGGCTTTTACTTCAACAAGCCC^CAGTCTATGGCTCCAGCATTCGGAGGGCACCACAG 120 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

I I i I : ! 1 1 I i 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill' 1 1 ; 1 1 1 1 1 i 1 1 1 1 1 1 1 ^ 1 1 

Db 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

I I I I II 1 I I I 11 I I I I I I I I I I I I I I I I II I I I I I II I I II I I I I I I I I I II I I I I I I I I 

Db 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACA 3 00 

MMMM MM MM MM MM MM MMMMMMMMMMIMM 

Db 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAAAAGGAAGCTGCAAAGG 3 00 

Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 360 

1 1 1 1 i I M ! I M I 1 1 ! I 1 1 1 i 1 1 1 1 1 1 1 i 1 1 1 1 1 ; I ! 1 1 1 1 1 

Db 301 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTG CAGGAAACAAGAC CTA 360 

Qy 361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 420 

Mlllllll Mill I INI III Mill I II lllllllllllllllllllllllll Mill 

Db 361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCA.CGTCACCGCAAGATCCTTTG 420 

Qy 421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCC^ 480 

1 1 II II II 1 1 1 1 M II 1 1 1 II II 1 1 II II 1 1 1 II 1 1 M II II II 1 1 1 1 1 1 II 1 1 1 1 II II 

Db 421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCAATAATGAGTTCA^ 480 

Qy 481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTA 53 9 

I II II II 1 1 II II II II M I M M M II II 1 1 1 II I II 1 1 II 1 1 M 1 1 M II I II II 1 1 

Db 481 TC^TTT(^GAGATGGG(^TTTCCCTCAATGAAATACACAAGTAAACATTCCCGGAATTC 53 9 



RESULT 2 



US-10-161-088-1 

Sequence 1, Application US/10161088 
Publication No. US20030077761A1 
GENERAL INFORMATION: 
APPLICANT: Parrow, Vendela 
APPLICANT: Rosengren, Linda 
TITLE OF INVENTION: NEW METHODS 
FILE REFERENCE: 13425-111001 
CURRENT APPLICATION NUMBER: US/10/161,08 8 
CURRENT FILING DATE: 2002-05-31 
PRIOR APPLICATION NUMBER: SE 0101934-8 
PRIOR FILING DATE: 2001-06-01 
NUMBER OF SEQ ID NOS : 3 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 1 
LENGTH: 651 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 
NAME/ KEY: CDS 
LOCATION: (73) . . . (471) 
US-10-161-088-1 

Query Match 84.2%; Score 453.8; DB 15; Length 651; 

Best Local Similarity 92.8%; Pred. No. 2.3e-140; 

Matches 476; Conservative 0; Mismatches 37; Indels 0; Gaps 0; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

MINIUM MINI MINIM IIIMM MM II N I II II I II 1 1 II 1 1 II I II 

Db 13 9 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 198 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGCACCACAG 120 

II I I II II I I I II II II I I I II I N I I I I I II I I I N I I I II I I I I I I II II I II III 

Db 199 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCTCAG 2 58 

Qy 121 ACGGG CATTGTGGATGAGTGTTG CTTCCGGAG CTGTGAT CTGAGGAGGCTGGAGATGTAC 180 

II llllll IIIIIMilllMMIIIIIIIIIII lillll I MM MINI 

Db 259 ACAGG CATTGTGGATGAGTGTTG CTTCCGGAG CTGTGAT CTGAGGAGACTGGAGATGTAC 318 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

INI II INI I II INI INI II II INN II III I NIMH I III I 

Db 319 TGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGCGCCACACTGAC 378 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAAAAGGAA 3 00 

III MM I WWW I INN I MM MM I IN INN INN II INN I 

Db 379 ATGCCCAAGACTCAGAAGTCCCCGTCCCTATCGACAAACAAGAAAACGAAGCTGCAAAGG 438 

Qy 301 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAAf^GACCTA 360 

NINIINN II III II INN WWW NINIINN INN III III II WWW 

Db 43 9 AGAAGGAAAGGAAGTAC^TTTGAAGAACACAAGTAGAGGAAGTGCAGGAAAa^AGACCTA 4 98 

Qy 361 CAGAATGTAGGAGGAG C CT CC CGAGGAACAGAAAATG CCACGT CAC CGCAAGAT CCTTTG 420 

I MMMMiiMI MM I II 1 1 1 II 1 1 1 1 1 II I I II 1 1 II I II I II 1 1 1 1 

Db 4 99 (^GAATGTAGGAGGAGCCTCCCACGGAGCAGAAAATGCCAC^TCACCGCAGGATCCTTTG 558 

Qy 421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATC^ 4 80 

MMMM MIMI IMMl I MUM I MM NINII III INN 



Db 



559 CTGCTTGAGCAACCTGCAAAACATCGAAA 618 



Qy 481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAA 513 

MM M MMMMMMMI MMMM 

Db 619 ACATTACAAAGATGGGCATTTCCCCCAATGAAA 651 



RESULT 3 

US-09-852-261-11 

; Sequence 11, Application US/09852261 

; Patent No. US20020083477A1 

; GENERAL INFORMATION: 

; APPLICANT: GOLDSPINK, GEOFFREY 

; APPLICANT: TERENGHI , GIORGIO 

; TITLE OF INVENTION: REPAIR OF NERVE DAMAGE - 
; FILE REFERENCE: 117-351 

; CURRENT APPLICATION NUMBER: US/09/852 , 261 

; CURRENT FILING DATE: 2001-05-10 

; PRIOR APPLICATION NUMBER: GB 0011278.9 

; PRIOR FILING DATE: 2000-05-10 

; NUMBER OF SEQ ID NOS : 14 

SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 11 

LENGTH: 4 87 

TYPE: DNA 

ORGANISM: Rattus sp . 
US-09-852-261-11 



Query Match 78.8%; Score 425; DB 9; Length 487; 

Best Local Similarity 90.4%; Pred. No. 7.6e-131; 

Matches 487; Conservative 0; Mismatches 0; Indels 52; Gaps ] 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

I ' 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 ; 1 1 1 1 1 ! i i 1 1 1 ! I ; 1 1 ! M ! 1 1 1 1 1 1 MINI 1 1 1 1 1 1 1 ' I 

Db 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 



Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTC 12 0 

MMM MMMMMMMMMMMMMMIMMMMMMMMMMMMI 

Db 61 AGGGGCTTTTACTTCAACAAGCCCAC^GTCTATGGCTCC 12 0 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

1 1 1 1 1 1 1 1 1 i ' 1 1 1 1 1 1 1 1 I ! ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 M M 1 1 ! 1 1 1 1 ! i 1 1 h 1 1 1 

Db 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTC 24 0 

MMMMMMMMMMMMMMMMMMMMMMMMMMM MMM 

Db 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 



Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAAAAGGAAGC 3 00 

MMMMMMMI 

Db 241 ATGCCCAAGACTCAG 255 

Qy 301 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGT 360 

MMM MMM MMMM MM MM MMM MMMM MMMM Ml 

Db 256 AAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 3 08 



Qy 



361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 42 0 



IMIIMIIIIIIIIMIIIIMIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIII 

Db 3 09 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 368 

Qy 421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCAATAATGAGTTCAATA 48 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 1 IN 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 

Db 3 69 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCAATAATGAGTTCAATA 428 

Qy 481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCCCGGAATTC 53 9 

Mill 1 1 MM I Mill I IMIIIMIIIIIIIIIIIIIIIMI MINI Mill MM 

Db 42 9 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCCCGGAATTC 4 87 



RESULT 4 
US-09-852-261-5 

Sequence 5, Application US/09852261 
Patent No. US20020083477A1 
GENERAL INFORMATION: 
APPLICANT: GOLDSPINK, GEOFFREY 
APPLICANT : TERENGHI , GIORGIO 
TITLE OF INVENTION: REPAIR OF NERVE DAMAGE 
FILE REFERENCE: 117-351 

CURRENT APPLICATION NUMBER: US/ 09/852 , 261 
CURRENT FILING DATE: 2001-05-10 
PRIOR APPLICATION NUMBER: GB 0011278.9 
PRIOR FILING DATE: 2000-05-10 
NUMBER OF SEQ ID NOS : 14 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 5 
LENGTH: 523 
TYPE: DNA 

ORGANISM: Oryctolagus cuniculus 
US-09-852-261-5 

Query Match 66.2%; Score 356.8; DB 9; Length 523; 

Best Local Similarity 82.3%; Pred. No. 4.2e-108; 

Matches 436; Conservative 0; Mismatches 87; Indels 7; Gaps 2; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Mill Mill II Mill MM I Mill MM 1 1 1 1 II 1 1 M 1 1 II I II I M I 

Db 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTAT 12 0 

MUM I MM I Mill II llllll II 1 1 II II M II 1 1 M 1 1 1 1 1 1 1 1 1 1 III 

Db 61 AGGGGCTTTTATTTQ^CAAGCCCACAGGATACGG 12 0 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II Mill III llllll II II IIMil MINI Mill MINI Mill i III 

Db 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

llll ll llllll Mill MM II III MM I J 1 1 1 1 1 1 1 1 1 1 1 1 III 

Db 181 TGTGCACCCCTCAAGCCGGOU^GGCAGCCCGCTCCGTCCGTGCCCAGCGCQ\CACCGAC 24 0 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACA 300 

MM 1 1 Ml III I II II 1 1 Mill I III II I II 1 1 II II 1 1 1 1 II Ml 

Db 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAAC^AGAAAATGAAGTCTCAGAGG 3 00 



Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAAC^O^GTAGAGGAAGTGCAGGAAACAAGACCTA 360 

I I III MINIM Ml II 1 1 1 ! 1 1 1 111 II! 1 1 1 E I Mill III II Mill III 

Db 3 01 AGAAGGAAAGGAAGTAt^TTTGAAGAACAC^AGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

Qy 361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 420 

III MM MM Mill I Mill I II I II I II Mill II MIMI 

Db 361 CAGGATGTAGGAAGAC CCTTCTGAGGAGTGAAGAAGGACAGGCCACCG CAGGACC CTTTG 42 0 



Qy 421 CTGCTTGAGC^CCTGCAAAAC^TCGGAACACCTGCCAAATATCAATAATGAGTTCAATA 480 

M I I Mill MM III MM II III MM II 

Db 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 474 

Qy 481 TCATTTCAGAGATGGGC^TTTCCCTCAATGAAATACACAAGTAAACATTC 530 

• 1 1 1 1 1 M MM MINIM IMM MIMI MMMIIMI 

Db 4 75 ACATTTCAAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 



RESULT 5 
US-09-852-261-1 

; Sequence 1, Application US/09852261 

; Patent No. US2 0020083477A1 

; GENERAL INFORMATION: 

; APPLICANT: GOLDSPINK, GEOFFREY 

; APPLICANT: TERENGHI , GIORGIO 

; TITLE OF INVENTION: REPAIR OF NERVE DAMAGE 

; FILE REFERENCE: 117-351 

; CURRENT APPLICATION NUMBER: US/09/852 , 261 
; CURRENT FILING DATE: 2001-05-10 

PRIOR APPLICATION NUMBER : GB 0011278.9 
; PRIOR FILING DATE: 2000-05-10 
; NUMBER OF SEQ ID NOS : 14 

SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 1 

LENGTH: 517 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-852-261-1 

Query Match 60.3%; Score 325.2; DB 9; Length 517; 

Best Local Similarity 81.2%; Pred. No. 1.4e-97; 

Matches 429; Conservative 0; Mismatches 88; Indels 11; Gaps 4; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 6 0 

Mill Mill II MIMI MUM III MMIMMIMM III 

Db 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATC 120 

II M 1 1 1 II 1 1 I II 1 1 1 1 1 1 1 1 1 1 II I II I II II I II I II 1 1 II I II 1 1 1 1 I M 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

Qy 121 ACGGG CATTGTGGATGAGTGTTG CTTCCGGAG CTGTGATCTGAGGAGG CTGGAGATGTAC 180 

II Mill I Ml III Ml I I III III MIMI MM II I 1 1 1 1 1 1 1 1 [ 1 1 1 f 1 1 f I 

Db 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 180 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCA.GCTCGTTCCATCCGGGCCCAGCGCC^CA.CTGAC 24 0 

III II II III II I MMMIIMI II MM III Ml MIMI II III 

Db 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 



Qy 241 ATGCCC^GACTCAGAAGTCCCAGCCCCTATCGA(^CACM,GAAAAGGAAGCTGCAAAGG 300 

IMIIMIIII III I I III III II Mill I 1 1 1 1 II I 

Db 241 ATGCCCAAGACCCAGAAGTATC^GCCCCCATCTACCAACAAGAACACGAAGTCTCA---G 2 97 

Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 360 

MMIMMIIMIIMI IIIIIIIIIIIIIIMIIM IMIIIIIIIIIIIII III 

Db 2 98 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 357 

Qy 361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 420 

III Mill II II III I Mill I II I II I IIIM II MUM 

Db 358 CAGGATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 416 

Qy 421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCAATAATGAGTTCAATA 4 80 

II I llllll MM III MM II III MM II 

Db 417 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 4 70 

Qy 481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACAT 528 

I Mill III IMIMIMI I I t I I M : I I I I I 1 I I 

Db 471 ACATTTCAAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 



RESULT 6 

US-09-919-497-24 

Sequence 24 , Application US/09919497 
Patent No. US20020106662A1 
GENERAL INFORMATION: 
APPLICANT: Mutter, George L. 

TITLE OF INVENTION: PROGNOSTIC CLASSIFICATION OF ENDOMETRIAL CANCER 
FILE REFERENCE: B0801/7225 

CURRENT APPLICATION NUMBER: US/ 09/919 , 4 97 
CURRENT FILING DATE: 2001-07-31 
PRIOR APPLICATION NUMBER: US . 60/221 f 735 
PRIOR FILING DATE: 2000-07-31 
NUMBER OF SEQ ID NOS : 100 
SOFTWARE: Pa tent- In version 3.0 
SEQ ID NO 24 
LENGTH: 7260 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-919-497-24 

Query Match 50.9%; Score 274.2; DB 10; Length 7260; 

Best Local Similarity 76.0%; Pred. No. 5.7e-80; 

Matches 405; Conservative 0; Mismatches 68; Indels 60; Gaps 3; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Mill Mill II Ml llllll I llllll MM I llllll MIMM III 

Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTC^ 12 0 

MIMIIMM MM llllll II I I ! 1 1 1 1 i 1 1 1 1 Mill! II Ml 

Db 371 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 43 0 

Qy 121 ACGGG CATTGTGGATGAGTGTTG CTTC CGGAG CTGTGATCTGAGGAGGCTGGAGATGTAC 180 

II Mill MIMM I I i I ! 1 1 M 1 1 : 1 1 1 1 1 1 II 1 1 1 1 II II II I II 1 1 1 1 

Db 431 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 4 90 



Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 240 

II I I I lllllll I lllllllllll II I I I I llllllllllllll III 

Db 4 91 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAAAAGGAAGCTGCAAAGG 3 00 

IIIIMIMII III 

Db 551 ATGCCCAAGACCCAG 565 

Qy 301 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 3 60 

IIIIMIMII lllllllll Ml MUM llllllllllllllll III 

Db 566 AAGGAAGTACATTTGAAGAACG CAAGTAGAGGGAGTG CAGGAAACAAGAACTA 618 

Qy 361 CAGAATGTAGGAGGAG CCTC C CGAGGAACAGAAAATGCCACGT CACCGCAAGAT CCTTTG 420 

III IIIIIIIIII Mill Mill I I II II I lllllll I f 1 1 1 1 1 1 1 

Db 619 CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTG 678 

Qy 421 - - CTGCTTGAGCAACCTGCAAAACATCGGAA.CACCTGCCAAATATCAATAATGAGTTC^ 4 78 

MM III Mill MM I lllllllll Ml II III MM I 

Db 679 CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCA AAAAATAAGTTTGA 732 

Qy 479 TATCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCC 531 

II Mill I MMMM MUM 1 1 ! i 1 1 1 II I M M 1 1 1 1 1 1 i It 1 1 1 

Db 733 TAACATTTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTCC 785 



RESULT 7 

US-09-880-107-3739 

Sequence 3739, Application US/09880107 
Patent No. US20020142981A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Home, Darci T. 
Vockley, Joseph G. 
Scherf, Uwe 
Gene Logic, Inc. 
TITLE OF INVENTION: Gene Expression Profiles in Liver Cancer 
FILE REFERENCE: 44921-5028 -WO 
CURRENT APPLICATION NUMBER: US/ 09/8 8 0 , 107 
CURRENT FILING DATE: 2001-06-14 
PRIOR APPLICATION NUMBER: US 60/211,379 
PRIOR FILING DATE : 2000-06-14 
PRIOR APPLICATION NUMBER: US 60/237,054 
PRIOR FILING DATE: 2000-10-02 
NUMBER OF SEQ ID NOS : 3950 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 3739 
LENGTH: 7260 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 

OTHER INFORMATION: Genbank Accession No. US20020142981A1 X57025 
US-09-880-107-3739 



Query Match 5 0.9%; Score 274.2; DB 10; Length 7260; 

Best Local Similarity 76.0%; Pred. No. 5.7e-80; 

Matches 405; Conservative 0; Mismatches 68; Indels 60; Gaps 



3; 



Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Mill Mill II MMIMIMMMIIMM IIIIIIIIIIIMIIIIIMI 

Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGCACCACAG 12 0 

lilllll I IMIIMM Mill MMIIMIMM MINIM II III 

Db 371 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 430 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

M IMM 1 1 MM II Ml 1 1 ! 1 1 1 1 1 ! i : 1 1 1 1 i 1 1 1 1 MM 1 1 MM III II 

Db 431 ACAGGCATCGTGGATGAGTG CTG CTTCCGGAG CTGTGATCTAAGGAGG CTGGAGATGTAT 4 90 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

III II II I M II I II I II II 1 1 1 1 II MM II IMMIIMI II III 

Db 4 91 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 ATGCCCMlGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAAAAGGAAGCTGCAAAGG 3 00 

IMIIIMIII III 

Db 551 ATGCCCAAGACCCAG 565 

Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTG CAGGAAACAAGAC CTA 360 

II II lilllll lllllllll III 1 1 Mill 1 1 ; I 1 1 1 ! t ! 1 1 1 Ml 

Db 566 AAGGAAGTACATTTGAAGAACG CAAGTAGAGGGAGTG CAGGAAACAAGAACTA 618 

Qy 361 CAGAATGTAGGAGGAGCCTGCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 42 0 

III II III II I II Mill Mill I I II II I lilllll III III III 

Db 619 CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTG 678 

Qy 421 --CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCAATAATGAGTTCAA 4 78 

MM III Mill MM I Mill M III M III 1 1 1 1 I 

Db 679 CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCA -AAAAATAAGTTTGA 732 

Qy 479 TATCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCC 531 

II Mill I IMIIIII MUM IMMMMIMMMMIMIIMI 

Db 733 TAACATTTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTCC 785 



RESULT 8 

US-09-873-319-707 

Sequence 707, Application US/09873319A 
Publication No. US20030134324A1 
GENERAL INFORMATION: 

APPLICANT: Munger, William E. 

APPLICANT: Kulkarni, Prakash 

APPLICANT: Getzenberg, Robert H. 

APPLICANT: Waga , Iwao 

APPLICANT: Yamamoto, Jun 

TITLE OF INVENTION: Identifying Drugs for and Diagnosis of Benign Prostatic 
TITLE OF INVENTION: Hyperplasia Using Gene Expression Profiles 
FILE REFERENCE: 44 92 1 -5029-US 
CURRENT APPLICATION NUMBER: US/0 9/873 , 3 19A 
CURRENT FILING DATE: 2001-06-05 
EARLIER APPLICATION NUMBER: US 60/223,323 
EARLIER FILING DATE: 2000-08-07 
NUMBER OF SEQ ID NOS : 755 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 707 



LENGTH : 726 0 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 

OTHER INFORMATION: Genbank Accession No. US20030134324A1 X57025 
US-09-873-319-707 

Query Match 5 0.9%; Score 274.2; DB 13; Length 7260; 

Best Local Similarity 76.0%; Pred. No. 5.7e-80; 

Matches 405; Conservative 0; Mismatches 68; Indels 60; Gaps 3; 



Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

MM! 1 1 1 1 1 1 1 I M Ml MM MM MM M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCC^ 12 0 

MM MM Ml MMMMMMMM 1 1 1 1 1 1 II 1 1 1 1 1 II II II 1 1 1 1 1 1 1 1 

Db 371 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 43 0 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II Mill I Ml III II 1 1 III II I II I MUM I MM 1 1 ! I ! I ! 1 1 1 1 i 1 1 1 

Db 431 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 4 90 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

Ml II II M II I I II 1 1 ! II 1 1 1 1 II MM MUM Mill II I III 

Db 4 91 TGCGC^CCCCTC^GCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 ATGCCO^GACTCAGAAGTCCCAGCCCCTATCGACACACAAGAAAAGGAAGCTGCAAAGG 3 00 

MIMMIMI III 

Db 551 ATGCCCAAGACCCAG 565 

Qy 301 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAAC^^ 360 

MM II III M Mill MM MINIMI llllllllll Mill I III 

Db 566 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTA 618 

Qy 361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 42 0 

III I Mill II II Mill Mill II II II I II II Ml lllllllll 

Db 619 CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTG 678 

Qy 421 - - CTGCTTGAGCAACCTGCAAAACATCGG 478 

MM III Mill MM I lllllllll III II III MM I 

Db 679 CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCA AAAAATAAGTTTGA 732 

Qy 479 TATCATTTCAGAGATGGG CATTT C C CT CAATGAAATA CACAAGTAAACATTCC 531 

II Mill I I I II MM I III II IIIIIIIIIIIIIII IIIIIIIIIM 
Db 733 TAACATTTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTCC 785 



RESULT 9 

US-09-960-706-1066 

; Sequence 1066, Application US/09960706 

; Publication No. US20030134280A1 

; GENERAL INFORMATION: 

; APPLICANT: Munger, William E. 

TITLE OF INVENTION: Identifying Drugs for and Diagnosis of Benign Prostatic 
Hyperplasia Using 

TITLE OF INVENTION: Gene Expression Profiles 



FILE REFERENCE: 44 921 -502 9 - 01US 
CURRENT APPLICATION NUMBER: US/ 0 9/960 , 706 
CURRENT FILING DATE: 2001-09-24 
PRIOR APPLICATION NUMBER : 60/223,323 
PRIOR FILING DATE: 2000-08-07 
PRIOR APPLICATION NUMBER: 09/8 73,319 
PRIOR FILING DATE: 2001-06-05 
NUMBER OF SEQ ID NOS : 1124 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 1066 
LENGTH: 7260 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : f 

OTHER INFORMATION: Genbank Accession No. US2003 0134280A1 X57025 
US-09-960-706-1066 

Query Match . 50.9%; Score 274.2; DB 13; Length 7260; 

Best Local Similarity 76.0%; Pred. No. 5.7e-80; 

Matches 405; Conservative 0; Mismatches 68; Indels 60; Gaps 3; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Mill Mill II MMMMMIMIMMM IMIIIMIIMIMMIMI 

Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGG 12 0 

MINI I MM MM III II I MUM II I M 1 1 II 1 1 1 1 I II 1 1 1 1 1 1 M II I 

Db 371 AGGGGCTTTTATTTCAACAAGCCC^CAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 43 0 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II Mill II III II Ml I 1 1 1 1 III 1 1 1 1 II 1 1 MIMMMIMIMM 

Db 431 ACAGG CATCGTGGATGAGTGCTG CTTC CGGAG CTGTGATCTAAGGAGG CTGGAGATGTAT 4 90 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTC^GCTCGTTCaiTCCGGGCCCAGCGCCACACTGAC 24 0 

III II I MUM I I Ml 1 1 MM I II MIMMMIMIMM I III 

Db 4 91 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAAAAGGAAGCTGCAAAGG 300 

1111:111 Ml 

Db 551 ATGCCCAAGACCCAG- - 565 

Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAAG^ 36.0 

IIIIMIIIII IMIIIMI IMMIIIII IMIIMIIIIIIIII III 

Db 566 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTA 618 

Qy 361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 42 0 

III I MUM I II Mill II Ml I I M M I Mill II Mill MM 

Db 619 CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATG CCACCG CAGGATC CTTTG 678 

Qy 421 - - CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATC^ 478 

MM III Mill MM I 1 1 1 1 i 1 1 1 1 Ml II III MM I 

Db 679 CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCA AAAAATAAGTTTGA 732 

Qy 4 79 TATCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCC 531 

1 1 Mill I IIIIMM llllll 1 1 1 1 1 1 1 1 1 1 !l I II II I Ml 

Db 733 TAACATTTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTCC 785 



RESULT 10 
US-10-136-639-4 

; Sequence 4, Application US/10136639 
; Publication No. ■ US20030072761A1 
; GENERAL INFORMATION: 
; APPLICANT: LeBowitz, Jonathan 

; TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR TARGETING PROTEINS ACROSS 
THE BLOOD BRAIN 

; TITLE OF INVENTION: BARRIER 
; FILE REFERENCE: SYM-0 08 

; CURRENT APPLICATION NUMBER: US/10/ 13 6 , 63 9 

; CURRENT FILING DATE: 2002-09-06 

; PRIOR APPLICATION NUMBER: US 60/329,650 

; PRIOR FILING DATE: 2001-10-16 

; NUMBER OF SEQ ID NOS : 4 

SOFTWARE: Patent In version 3.0 
; SEQ ID NO 4 

LENGTH: 7260 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-136-639-4 

Query Match 50.9%; Score 274.2; DB 15; Length 7260; 

Best Local Similarity 76.0%; Pred. No. 5.7e-80; 

Matches 405; Conservative 0; Mismatches 68; Indels 60; Gaps 3; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

IMIMII 1 1 II llllllllllllllllllll II I lllllllllll III MM 

Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGCACCACAG 12 0 



Db 




Qy 



121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 



Db 




Qy 



181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 



Db 




Qy 




Db 



Qy 



3 01 AGAAGGAAAGGAAGTAC7VCTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 360 



Db 




Qy 



3 61 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 42 0 



Db 



619 CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTG 6 




Qy 



421 



- - CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCAATAATGAGTTCAA 478 



Db 



679 CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCA AAAAATAAGTTTGA 732 



Qy 47 9 TATCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCC 531 

II Mill I MINIM MUM MMIMMIMMMIMMIMM 

Db 733 TAACATTTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTCC 785 



RESULT 11 
US-10-207-655-54 

Sequence 54, Application US/10207655 
Publication No. US20030118592A1 
GENERAL INFORMATION: 
APPLICANT: Ledbetter, Jeffrey A. 
APPLICANT: Hayden-Ledbetter , Martha S. 

TITLE OF INVENTION: BINDING DOMAIN- IMMUNOGLOBULIN FUSION PROTEINS 
FILE REFERENCE: 390069. 401C1 
CURRENT APPLICATION NUMBER: US'/ 10/2 07 , 655 
CURRENT FILING DATE: 2 002-07-2 5 
NUMBER OF SEQ ID NOS : 426 
SOFTWARE: Patent In version 3.0 
SEQ ID NO 54 
LENGTH: 725 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-207-655-54 

Query Match 50.6%; Score 272.6; DB 15; Length 725; 

Best Local Similarity 75.8%; Pred. No. 5.6e-80; 

Matches 404; Conservative 0; Mismatches 69; Indels 60; Gaps 3; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

MM Mill II MMM IMMMMIM 1 1 II 1 1 II II II II I II 1 1 1 1 

Db 156 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 215 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATG 12 0 

III III Mill IIMIMMMIMM MMM M Mill Illlllll! II III 

Db 216 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 275 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

II II II MM MMM MMMMMMI I I II 1 1 1 1 1 1 II 1 1 1 II M 

Db 276 ACAGGTATCGTGGATGAGTGCTG CTTCCGGAG CTGTGAT CTAAGGAGG CTGGAGATGTAT 335 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

III II I MMM I Ml Illlllll II MM I MMM MM III 

Db 336 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 395 

Qy 241 ATGCCGAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAAAAGGA^ 3 00 

MM, M Ml 

Db 396 ATGCCCAAGACCCAG 410 

Qy 301 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGC^ 360 

1 1 Illlllll I MM Mill I Mill Ml I III Illlllll Mill III 

Db 411 AAGGAAGTACATTTGAAGAACG CAAGTAGAGGGAGTG CAGGAAACAAGAACTA 4 63 

Qy 361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 420 

Ml Ml 1 1 III II Mill Mill I I II II I II I II II I II III Ml 

Db 464 CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTG 523 



Qy 421 - - CTGCTTGAGCAACCTGCAAAAC^^ 478 

1 1 1 1 111 iiiii 1 1 1 1 i 1 1 1 1 1 1 1 1 1 111 ii 111 mi i 

Db 524 CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCA AAAAATAAGTTTGA 577 

Qy 4 79 TATCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTCC 531 

II lllll I I II III II 1 1 Ml I I III I Illllllllll Mlllll III 

Db 578 TAACATTTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTCC 63 0 



RESULT 12 
US-09-852-261-13 

Sequence 13, Application US/09852261 
Patent No. US20020083477A1 
GENERAL INFORMATION: 
APPLICANT: GOLDS PINK, GEOFFREY 
APPLICANT : TERENGHI , GIORGIO 
TITLE OF INVENTION: REPAIR OF NERVE DAMAGE 
FILE REFERENCE: 117-351 

CURRENT APPLICATION NUMBER: US/09/852 , 261 
CURRENT FILING DATE: 2001-05-10 
PRIOR APPLICATION NUMBER: GB 0011278.9 
PRIOR FILING DATE: 2000-05-10 
NUMBER OF SEQ ID NOS : 14 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 13 
LENGTH: 471 
TYPE: DNA 

ORGANISM: Oryctolagus cuniculus 
US-09-852-261-13 

Query Match 48.6%; Score 262; DB 9; Length 471; 

Best Local Similarity 74.7%; Pred. No. 1.5e-76; 

Matches 3 96; Conservative 0; Mismatches 75; Indels 59; Gaps 3; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Mill Mill II lllll llllllllllllll MINIMI. MINI III 

Db 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCC^GC^TTCGGAGGGCACCACAG 120 

Mlllllllll MMMMMIIMM II MMIIIMI I IIMIMI III 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCC^GCAGTCGGAGGGCACCTaVG 12 0 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II Mill Mlllll I Ml IIIIMIIIMIIIIIIIMI MMMMIMI 

Db 121 ACAGG CAT CGTGGATGAGTGCTG CTTCCGGAG CTGTGAT CTGAGGAGG CTGGAGATGTAC 18 0 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

MM II 1 1 1 1 1 1 lllll 1 1 1 1 II III 1 1 1 1 llllllllllllll III 

Db 181 TGTGCACCCCT(^GCCGG(^U^GGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

Qy 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAAAA 3 00 

III llllll III 

Db 241 ATGCCCAAGACTCAG 255 

Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTG(^GGAAACAAGACCTA 36 0 

MM llllll I IMNMIMI Mlllll MMMMMIIMM III 



Db 



256 



AAGGAAGTAOVTTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 308 



QY 



361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 420 



Db 



3 0 9 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 368 




Qy 



421 CTGCTTGAGCAACCTGCAAAACATCGGAAC^CCTGCCAAATATCAATAATGAGTTCAATA 480 



Db 



369 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 422 




Qy 



481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTC 53 0 



Db 




RESULT 13 
US-10-251-661-7 

; Sequence 7, Application US/10251661 
; Publication No. US20030166555A1 
; GENERAL INFORMATION: 

APPLICANT: Alberini, Cristina M . 
; APPLICANT: Bear, Mark F. 

; TITLE OF INVENTION: Methods and Compositions for Regulating 

; TITLE OF INVENTION: Memory Consolidation 

; FILE REFERENCE: 3499.1001-003 

; CURRENT APPLICATION NUMBER: US/10/251 # 661 

; CURRENT FILING DATE: 2002-09-20 

; PRIOR APPLICATION NUMBER: 60/193,614 

/ PRIOR FILING DATE: 2000-03-31 

PRIOR APPLICATION NUMBER: PCT/USOl/ 10661 
; PRIOR FILING DATE: 2001-04-02 
; NUMBER OF SEQ ID NOS : 12 

SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 7 

LENGTH: 612 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 
NAME /KEY: CDS 
LOCATION: (151) . . . (564) 
US-10-251-661-7 

Query Match 41.6%; Score 224; DB 13; Length 612; 

Best Local Similarity 75.6%; Pred. No. 8e-64; 

Matches 316; Conservative 0; Mismatches 50; Indels 52; Gaps 1; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Mill Mill II llllllllllllllllllll llllllll IIIIIIIIIM 

Db 247 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 3 06 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGt^TTCGGAGGGCA 120 



Db 



307 




Qy 



121 



ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 



Db 



367 




Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCC^ 24 0 

Ml II II Mill I MM Mill II II MM II II III M Mill III 

Db 427 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCC^GCGCaVCACCGAC 486 



Qy 241 ATGCCCTUIGACTCAGAAGTCCCA^ 3 00 

MIMIIMM III 

Db 4 87 ATGCCCAAGACCCAG 501 

Qy 3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTG CAGGAAACAAGAC CTA 360 

MIMIIMM MUM III I MUM III II II III II MM I II III 

Db 502 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTG CAGGAAACAAGAACTA 554 



Qy 361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTT 418 

Ml MM MM II Mill Mill I I 1 1 II I 1 1 II II I II II 1 1 1 

Db 555 CAGGATGTAGGAAGACCCTC CTGAGGAGTGAAGAGTGACATG CCACCG CAGGAT C CTT 612 



RESULT 14 
US-10-161-088-3 

; Sequence 3, Application US/10161088 

; Publication No. US20030077761A1 

; GENERAL INFORMATION: 

; APPLICANT: Parrow, Vendela 

; APPLICANT: Rosengren, Linda 

; TITLE OF INVENTION: NEW METHODS 

; FILE REFERENCE: 13425-111001 

; CURRENT APPLICATION NUMBER: US/lO/161 , 088 

; CURRENT FILING DATE: 2002-05-31 • 

; PRIOR APPLICATION NUMBER: SE 0101934-8 

; PRIOR FILING DATE: 2001-06-01 

; NUMBER OF SEQ ID NOS : 3 

; SOFTWARE: Fast SEQ for Windows Version 4.0 
; SEQ ID NO 3 

LENGTH: 2 86 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-161-088-3 



Query Match 3 9.7%; Score 213.8; DB 15; Length 28 6; 

Best Local Similarity 89.5%; Pred. No. 1.3e-60; 

Matches 23 0; Conservative 0; Mismatches 27; Indels 0; Gaps 0; 
Qy 16 TGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCAAGGGGCTTTTACTTC 75 

1 1 1 1 1 II II I M M I II 1 1 1 I Mill MIMIIMM III III I Ml III Mill Ml 

Db 18 TGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCGAGGGGCTTTTACTTC 77 

Qy 76 AACAAGCCC^CAGTCTATGGCTCCAGCATTCGGAG 135 

1 1 1 M 1 1 M 1 1 1 1 I I I M i I I I, I Mill hi I 

Db 78 AACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCT(^GACAGGCATTGTGGAT 137 

Qy 136 GAGTGTTG CTTC CGGAG CTGTGATCTGAGGAGGCTGGAGATGTACTGTGT CCG CTG CAAG 195 

I I 'M I I I I ! ,1 ! I IIIIIMIIIIIIMI II III 

Db 138 GAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGACTGGAGATGTACTGTGCCCCACTGAAG 197 



Qy 196 CCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACA 2 55 

I II MM I MM II II Mill 1 1 1 1 1 M 1 1 M 1 1 1 1 1 M M 1 1 1 II 1 1 1 1 1 1 1 1 



Db 198 CCTACAAAAGC^GCCCGCTCTATCra^ 257 

i 

Qy 2 56 AAGTCCCAGCCCCTATC 272 

Mill I II 
Db 258 GCATGCAAGCTTGTCTC 274 
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SEQ ID NO 9 
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Query Match 38.4%; Score 206.8; DB 9; Length 318; 

Best Local Similarity 87.6%; Pred. No. 3e-58; 

Matches 226; Conservative 0; Mismatches 32; Indels 0; Gaps 0; 
Qy 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 6 0 

Mill Mill II 1 1 MINI 1 1 Mill I MM 1 1 1 1 II i II I II II 1 1 1 1 1 1 1 

Db 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATG 12 0 

II Ml II I Ml II III Ml II Ml III I 1 1 1 1 i I 1 1 1 1 I 1 1 MM 1 i I 1 1 . 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

Qy 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II Mill I MUM I Ml MM MUM llllll MM II I II III Mill MM 

Db 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 

Qy 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGT^ 24 0 

II I II II II 1 1 1 I II 1 1 1 II II 1 1 II MM III I llllll I MM II 

Db 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

Qy 241 ATGCCCAAGACTCAGAAG 258 

I llllll! llllll 

Db 241 ATGCCCAAGACCCAGAAG 258 

Search completed: December 13, 2003, 11:56:46 
Job time : 241.655 sees 
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